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ABSTRACT 


The methods of modeling studied in the present work include fuzzy least square 
regression, cluster wise fuzzy regression, fuzzy auto regressive moving average(AR\LA.) 
method, Sugeno-type fuzzy system identification technique and orthogonal parameter 
estimator embedded with fuzzy discretization (ortho-clustering ) technique. Fuzz>' least 
square regression with its four variations are developed. The effect of different modeling 
functions upon the performance of the fuzzy regression is evaluated. Cluster wise 
regression is used mainly to deal with the heterogeneity of the observed data. The effect 
of different clustering algorithms upon the performance of the cluster wise regression is 
evaluated. The simple fuzzy' regression method is applied to the ARMA technique for 
modeling a dynamical system. Sugeno-type fuzzy' identification technique is developed 
to include tire fuzzy reasoning and implications in modeling of the system. The premise 
parameters and consequence parameters identification is separated through ortho-cluster- 
ing technique and the effect of different clustering methods on its performance is evalu- 
ated. The above algorithms are applied to the problems of estimation of life of converter 
lining and modeling of Box Jenkins'gas furnace for evaluating their performance. A 
comparison of performance of all the developed methods is brought out from the results 
of the two test problems. The test problems are also modeled through neural networks 
and results are presented for comparison purpose. The performance of cluster wise fuzzy 
regression is the best of all the other techniques for modeling a system having inherent 
imprecision and/or having very few data to describe the system, where as for a simple 
system like Box Jenkins’ gas furnace cluster wise conventional regression has better perf- 


ormance. 
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Chapter 1 


INTRODUCTION 

Modeling is the process of establishing a functional relationship among the dependent 
and independent variables of the system. There are a number of techniques available for 
system modeling, but their application depends on the nature of the system to be modeled. 
For example, auto regressive moving average technique is used in modeling a time series 
or a dynamical system. For modeling a static system, conventional regression is used, 
where as probabilistic methods are used for a system including random errors. S imil arly 
to model a system described by very few data and/or imprecise data, fuzzy system mode- 
ling techniques are used. The major part of the thesis is devoted to evaluate the performa- 
nce of various fuzzified models for a given modeling technique. A comparison between 
conventional and fuzzified methods have been presented. The measure of performance 
is considered to be mean error, mean square error, root mean square error, standard 
deviation of error, maximum error, miaimum error, slope(ratio of predicted to the actual ) 
where error is given by the difference of actual value to the predicted value. 

1.1 Brief review of modeling methods 

The present section discusses the advantages and shortcomings of the following meth- 
ods of modeling whose performance is evaluated in the present thesis, 
i. Fuzzy Least square regression modeling 

In Diamond [1], several models are proposed as fuzzy analogues of simple lin- 
ear least squares, and use fuzzy data to compute the parameters. Equations are derived 
which are rather similar to the normal equations of classic least squares for single input 
and single output system. In this approach, the solution of the least square optimization 
problem depends on the sign of the coefficient parameter of the model and proposed 
method findin g solution is very particxilar to the single input system where the cases arise 
are only two. But the extension of the same to multi input systems needs heavy computat- 
ion as the cases are 2“ where n is the number of input variables. 



ii. Cluster wise regression modeling 

The regression analysis in the case of heterogeneity of observations is commonly 
presented in practice. To handle the problem of heterogeneity, the observations are clust- 
ered using a suitable technique and then their membership values are used as weights in 
the weighted least square estimation. This approached is proposed in Yang and Ko[ 2]. 
Disadvantage of this method is that the results heavily depend on the chosen clustering 
algorithm. 

iii. Auto regressive moving average (ARMA) modeling 

ARMA class of models, developed by Box and Jenkins [ 3 ]has become one of the 
most popular time series forecasting models. To simulate and predict a time series, it is 
modeled as the output of the dynamic system whose input is white noise. Such a model 
can be described in several ways, but if parsimonious parameterization is required then 
ARMA model is employed. The conventional ARMA has the inability to model the 
imprecise and ambiguous data. This problem is dealt in detail in this thesis. 

iv. Sugeno-type Fuzzy Identiflcation Method 

The fuzzy model suggested by Takagi and Sugeno [ 4 jin 1985 can represent or 
model a general class of static or dynamic nonlinear systems. It is based on “Fuzzy 
partition” of input space and it can be viewed as the expansion of piecewise linear 
partition is represented as 

K : If Xi is Ai' and X 2 is A 2 ‘ , and Xm is Am' 

then y -ao'+ai 'xi + +am'xni (1.1) 






w' = MIN^^,A){Xj) 


where R' (i=l,2, ,c) denotes the i^ fuzzy rule, and Xj(j=l,2, m) is the input and y* is 

the output of the fuzzy rule R‘. For simplicity, a system with, multi-input and single 
output (MISO) is assumed. In the case of a multi-output system, several output variables 

such as yi' and y 2 ' are used. Ai',A 2 , ,Anj‘ are fuzzy variables with bell-typed, 

trapezoidal, triangular, or other membership functions representing a fuzzy subspace in 
which the implication R' can be applied for reasoning. From Eq. 1.1 andEq. 1.2, it is 


tinted that Takam and Suereno’s fuzzv model annroximates a nonlinear svstem with a 



combination of several linear systems by decomposing fuzzily the whole input space into 
several partial spaces and representing each input/output (I/O) space with each linear 
equation. In [ 4 ], the identification of the fuzzy model described in Eq. 1.1 is carried out 
iteratively in the following way: first, premise parameters are assumed, then consequent 
parameters are optimally adjusted with respect to the given premise parameters, and then 
the premise parameters are adjusted iteratively by complex algorithm, a nonlinear optimi- 
zation method. However, implementing such an algorithm seems not an easy exercise[5 ], 
because the problems of determining the optimal membership variables involve a nonlin- 
ear programming problem. 

V. Orthogonal parameter estimation technique ( Ortho-Clustering Method ) 

The basic idea of orthogonal parameter estimator [ 5 ] is to separate the premise 
identification from the consequence identification, while these are mutually related in the 
Sugeno-type model. The separation of the premise identification from the consequence 
identification is a significant advantage of the proposed modeling approach, because it 
can greatly simplify the process of building a Sugeno-t\-pe model In this approach, the 
premise of the model is first determined using a fuzzy discretization technique by 
constructing reference fuzzy sets. This amounts to the partition of the input space. The 
number of reference fuzzy sets determines the number of rules and number of linear 
equations in the consequent part of the model. The parameters of these linear equations 
are then estimated using an orthogonal estimator. 

For determining the reference sets, the data are classified using suitable clustering 
algorithm. The chosen clustering algorithm plays animportant role in the division of input 
space, hence the performance of the estimator. The proposed method does not evaluate 
the effect of clustering method upon the performance of the model. 

1.2 Description of clustering methods used 

This section presents a brief idea about various clustering algorithms used in 
cluster wise regression and orthogonal parameter estimation, for evali^ating the effect of 
classification method upon the performance of each model. The detailed explanation 
about the implementation of these methods is given in [14]. 
i. Fuzzy c-means clustering 

This method uses concepts in n-dimensional Euclidean space to determine the ge- 
ometric closeness of data points by assigning them to various clusters or classes and then 
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several partial spaces and representing each input/output (I/O) space with each linear 
equation. In [ 4 ], the identijScation of the fuzzy model described in Eq. 1.1 is carried out 
iteratively in the following way: first, premise parameters are assumed, then consequent 
parameters are optimally adjusted with respect to the given premise parameters, and then 
the premise parameters are adjusted iteratively by complex algorithm, a nonlinear optimi- 
zation method. However, implementing such an algorithm seems not an easy exercise[5 ], 
because the problems of determining the optimal membership variables involve a nonlin- 
ear programming problem. 

V. Orthogonal parameter estimation technique ( Ortho-Clustering Method ) 

The basic idea of orthogonal parameter estimator [ 5 ] is to separate the premise 
identification from the consequence identification, while these are mutually related m the 
Sugeno-type model. The separation of the premise identification from the consequence 
identification is a significant advantage of the proposed modeling approach, because it 
can greatly simplify the process of building a Sugeno-t\-pe model. In this approach, the 
premise of the model is first determined using a fiizzy discretization technique by 
constructing reference fuzzy sets. This amounts to the partition of the input space. The 
number of reference fuzzy sets determines the number of rules and number of linear 
equations in the consequent part of the model. The parameters of ihese linear equations 
are then estimated using an orthogonal estimator. 

For determining the reference sets, the data are classified using suitable clustering 
algorithm. The chosen clustering algorithm plays anlmportant role in the division of input 
space, hence the performance of the estimator. The proposed method does not evaluate 
the effect of clustering method upon the performance of the model. 

1.2 Description of clustering methods used 

This section presents a brief idea about various clustering algorithms used in 
cluster wise regression and orthogonal parameter estimation, for evaluating the effect of 
classification method upon the performance of each model. The detailed explanation 
about the implementation of these methods is given in [14]. 
i. Fuzzy c-means clustering 

This method uses concepts in n-dimensional Euclidean space to determine the ge- 
ometric closeness of data points by assigning them to various clusters or classes and then 
determining the distance between the clusters. 



K-mean clustering algorithm jSnds a set of cluster centers and partitions the train- 
ing data into subsets. For the sake of continuity, the subsets overlap to a limited extent. 
Each cluster center is associated with one of the hidden units (h) m the network. The data 
are partitioned such that the trakiing points are assigned to the cluster with nearest center. 

iii. Self Organizing Map ( SOM) 

Kohenon network performs clustering through a competitive lea rnin g mechanism 
called “wiimer takes all”. In essence, the node with largest activation level is declared the 
winner in the competition. This node is the only node, suppressed to the zero activation 
level. Furthermore, this node and its neighbors are the only nodes permitted to learn for 
the current input pattern. After training, the weight vector of each node encodes the infor- 
mation of a group of similar input patterns. Given an input vector, it is assigned to the 
node with the maximum activation. Since the number of nodes is fixed, the net algorithm 
is similar to the K-means clustering algorithm. 

iv. Adaptive resonance theory (ART2) 

ART2 is widely used clustering technique for analog or continuous valued patt- 
erns. The patterns are classified or clustered with the accuracy defined by a factor called 
“vigilance factor”., Their ability to generalize is limited; however, the ability of this netw- 
ork to create new pattern class in its knowledge base on the arrival of novel pattern makes 
it very suitable for clustering. The classification is dependent on the presentation of input 
patterns. 

v. Fuzzy ART 

Fuzzy ART, can classify both binary and analog valued patterns. In this network 
also, the clustering is mainly dependent on the factor called vigilance factor and order of 
presentation of input patterns also plays role in classification. This is also an unsupervised 
network, because one need not to specify number of clusters. This is determined by the 
network itself depending upon the vigilance factor 
1.4 Problem statement 

Main emphasis of this work is to compare the performances of different modeling 
techniques. Here we propose a new approach to fuzzy least square method for multi input' 
system which overcomes the disadvantage ofDiamond[ 1 ] method as mentioned before. 
The effect of modeling functions upon the performance of the model is studied. We 
develop Fuzzified models of ARMA and cluster wise regression by applying the fuzzified 
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1.5 Test System description 

In the present work two problems are considered for evaluating the 

Box 

the developed models viz. estimation of life of converter linin g and men® 

Jenkins’ gas furnace. 

1.5.1 Estimafion of life of converter lining problem 

The life of steel making converter lining can be expressed in terms 

variables 

heat runs. This number of heat runs can be considered to be a function oi 

, din hot metal, 

namely Hot metal weight HM C, Silicon in hot metal, Mn in hot metal, 
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data set for these 30 variables is collected for 15 campaigns. Various dimoost*^ 

. . igo taken into 

ction techniques have been applied and the experience of R&D of SAIL is a^® 
account to model the converter life with reduced number of variables, 
i. Estimation of life converter lining problem (ICA) 

Through ICA ( independent component analysis ) the following 1 ^ ^ 
selected as inputs and life of converter lining in terms of heat runs as out ^ 

input variables selected are mean hot metal temperature, mean blow O 2 , hi®^ 

bathPjtne^n 

temperature, mean lime added, mean ore, mean bath C, mean bath S, mean 

data^®Si" 

basicity of slag, mean %FeO, mean %MgO, mean %CaO, mean %MnO. Th® 


tap to tap 


ven in Table 1.1 

ii. Estimation of life converter lining problem (PCA) 

Through PCA (principal component analysis) the following 13 varia' 


bles 
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selec- 
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variables selected are mean hot metal temperature, mean Si in hot metal, iu®a^ added 
metal, mean blow O 2 , mean tap to tap temperature, mean tap to tap time, lU®^ 
mean ore, mean bath C, mean bath S, mean bath P, mean basicity of slag, 
mean %CaO. The data is given in Table 1 .2. 


%FeO, 


S' 



iii. Estimation of life converter lining problem (mean, R&D) 

Through experience of RDCIS group, the following 13 variables are selected as 
inputs and life of converter lining in terms of heat runs as out put.[14] The input variables 
selected are mean ratio of hot metal to (hot metal + scrap), mean Si in hot metal, %Si in 
hot metal greater than equal to 1.1, mean Mn in hot metal, mean blow O 2 , mean tap to tap 
temperature greater than or equal to 1700, mean tap to tap time greater than are equal to70 
minutes, %presence of slag coating, mean lime added, %C in steel less than or equal to 
0.05, %basicity of slag less than or equal to 2.5, %FeO greater than or equal to 22, mean 
MgO in slag. The data is given in Table 1.3. 

iv. Estimation of life converter lining problem (median, R&D) 

Through experience RDCIS group, the following 1 1 variables are selected as inpu- 
ts and life of converter lining in terms of heat runs as out put. The input variables selected 
are median of ratio of hot metal to (hot metal + scrap), median Si in hot metal, median 
Mn in hot metal, median blow O 2 , median tap to tap temperature, median tap to tap time, 
median lime added, median %C in steel, median %basicity of slag, median %FeO, 
median MgO in slag [ 14 ]. The data is given in Table 1.4. 

In order to decide the model parameters so as to make the model convergent with 
as many samples as possible, it was decided to train the model with 1 3 samples of input 
out put combinations. Testing was done with 2 samples of input combination to predict 
the life of the converter lining. The output was compared with the actual life of the conve- 
rter lining to evaluate the performance of the fitted model. 

1.5.2 Modeling of Box Jenkins’ gas furnace data 

In the present work. Box and Jenkins’ gas furnace data is used [ 3 ]. The data con- 
sist of 296 I/O measurements of a gas furnace system: the input measurement u(k) is gas 
flow rate into the furnace and the output measurement is CO 2 concentration in out let gas. 
The sampling interval is 9 seconds. For comparison with conventional fu 22 y models, 
u(k), u(k-l), u(k-2) and y(k-l), y(k-2), y(k-3) are chosen as the variable of the fu 22 y model. 
The data are given Table 1.5 
1.6 Organization of thesis 

Fuzzy least square regression including its four variations and conventional least 
square regression ( variation 5), cluster wise regression and ARMA techniques are discus- 
sed in chapter 2. Sugeno-type fuzzy system identification and orthogonal parameter estim- 
ation ( ortho-clustering ) techniques are discussed in chapter 3. Results of each modeling 



technique with respective to two problems of estimation of life converter linin g and mod- 
eling of Box Jenkins’ gas furnace are discussed in the same section where the respective 
modeling method is described in detail. A comparison of performance of each modeling 
method is brought out in chapter 4. Fmaily the conclusions and future scope of the work is 
presented in chapter 5. 



Chapter 2 


2.1 Fu 22 y least square regression modeling 

2.1.1 Introduction 

Regression analysis is used in evaluating the functional relationship between the 
dependent and independent variables also in determining the best-fit model for describing 
the relationship. In the usual conventional model the deviations between the observed 
values and the estimated values are supposed to be due to measurement errors or random 
variations. But sometimes the deviations are due to the mprecise observed data or the 
indefiniteness of system structure. In this case the uncertainty is not due to randomness 
but fuzziness. Regression analysis on fuzzy data in dealing with fuzziness is usually 
called Fuzzy Regression Analysis. 

Tanaka et al. [ 6 ] first proposed this study in linear regression analysis with the 
fuzzy model. They considered the linear fuzzy regression modelY=AiXi+.. ..+Ap Xp where 

the parameters Ai , ,and Ap are triangular fuzzy numbers and the independent variables 

Xi, , Xp are real-valued numbers. Then they transformed the optimization problem 

of estimation to the linear programming problem. Based on this approach, Tanaka 
et al. [ 7 ], [ 8 ] and Ishibuchi et al. [ 9 ] continued research in this area. A generalization 
of the Tanka approach for the general form of regression equations about LR-type fu- 
zzy numbers, is developed by Bardossy [ 10 ]. 

We note that the Tanaka approach is quite complicated in solving the optimization 
problem. It is unclear what the relation is to a least-square concept. The measure of best 
fit by residuals is not presented in the Tananka approach. Therefore, Diamond [ 1 ] 
proposed the so-called fuzzy least squares. Based on a metric d/on the space F(R) of all 
fuzzy numbers Diamond gave a metric d on the space F( R) by 


d^[X, ■-X^]=\x, -xj + (x, -x, -(a , -ajf +{x, -x^-{j3, } 



(1) Fu 2 zy input and fli 22 y output 

Y = a + bX whereu,6ei?, 

and 


Y = A+bX where A,XeF^(R), 


(2) nximerical input and fiizzy output 


Y = A+Bx, where xe R, A,BeFj.(R) 

where F-j-(R) is a fuzzy space. 

the corresponding least square optimization problems axe 


minimize 


minimize 


r(a,b)= Y.d- Yi,a + bX, 


~ \ 


1=1 


- ^ 


r(^,b)= Yi,A+bX, 


::=1 


minimize 


r{A,B)= Y„A+Bx, 


....( 2 . 1 ) 

....( 2 . 2 ) 

....(2.3) 


1=1 V / 

From the Eq 2.1 and Eq 2.2 it can be clearly seen that the function to be minimized depe- 
nds on the sign of the coefficient parameter of each input of the model, thus giving rise to 
different solutions for different cases. In Diamond [ 1 ] the proposed method findin g solu- 
tion is very particular to the single input system where the cases arise are only two. But 
the extension of the same to multi input system needs heavy computation as the cases are 
2" where n is the number of input variables. 

The technique presented in the present work overcomes this problem for multi 
input single output system modeling. In addition to that the effect of different modeling 
functions upon the performance of each model is studied. Developed models are applied 
to the problems of estimation of life of converter lining and Box Jenkins’ gas furnace 
modeling and the results were discussed in the section 2.1.4. 

2.1.2 Formulation of Simple Fuzzy Least Squares problem 

The formulation of the method is based on the use of L-R fuzzy mnnbers (see Ap- 
pendix) and their very simple form of distance d[.,.] ,and reduces the problem to that of 
finding the mhaunum of a classical fiinction. The fuzzy numbers of type L-R constitute a 



special class of fiizzy numbers very useful in estimation and other applications. Specific- 
ally, a fuzzy L-R number M has a membership function of the type 

/ ri((m-x)/a),.jc<OT 
-x)/y5),.jc > /K 

where m is a classical number, a and fi are parameters, and L(.), R(.) are functions of spe- 
cial type (Satisfying the condition of Def A.l in the Appendix). Svinbolically an L-Rfuz- 
zy number M is denoted by: 

y[. = {m,(x,p) 

Let F( R ) be the set of all fuzzy L-R numbers that are defined on R. On the set F( R ) one 
can define a linear structure and a norm d[.,.]^ which is the distance of two fuzzy L-R nu- 
mbers Xi=(xi, and X 2 =(xz can be determined . 

{(x, + {x, --r, -(a, -a.)f +(x, -x.-{p, } 

as given in [ 11 ]. Let Y is the output vector of m number of samples and Xj is the vector 
of m samples of j* input variable, n is the number of input variables. Considering the 
three simple fuzzy regression models as in Diamond [ 1 ] and applying them to the case of 
multi input single output system gives rise to the following models 

(1) Fuzzy input and fuzzy output 

^ - - 

( FI): F = Uq +'^ajXj where aQ,aj e R, Yy e Fj-(i?), j =1,2,. ...n 

j=i 

and 

(F2): Y = Ao+^Uj X j where Cj e R, Ao,Xj e Fj-{R), j=l,2,....n 

7=1 

(2) Crisp input and fuzzy output : 

Suppose that data pairs Xi j , F;. i= 1 ,2,. .m and j= 1 ,2 . . . .n are observed, where the real num- 
bers Xij are non-negative and each F" is fuzzy. The model can be defined by 
Y^Ao+'^AjX. , 

7=1 


where Ao,AjgF^(R) 



2.1.3 Solution of fii 2 zy least square problem : Fuzzy input and fuzzy output 

— n ^ 

variation 1: (FI) 7 = + where e R, Xj s Fj.(R), j=l,2,....n and 

— — n ^ ^ 

variation 2: (F2) 7 = Ao+^a. Xj where a. e R, Ao,Xj e F^(R), j =1,2,. ..m 

y=i 

Each is to be fitted to the data in the sense of best fit with respective to the d^tR-metric. 
Clearly (F2) mildly generalizes (FI). 

In association with the model (F2), formulated as follows 
variation lifuzzy parameters with fuzzy input 

Solution of fuzzy least square problem comprises essentially the following steps 
Step ( i ) : Preprocess or transform ( scale ) the data. 

Step ( ii ) : Fuzzify the input and output ( if necessary ) 

Step ( iii ) : Initialize the model parameters 
Step ( iv ) : Initialize .s, ' terms 

Step ( V ) : Evaluate the classical function basing on metric 
Step ( vi ) : Get the simultaneous equations by partially differentiating the classical 
function with respective to each parameter. 

Step ( vii ) : Solve the simultaneous equations to get the model parameter 
Step ( viii ) : Evaluate the stopping criterion 

Step ( ix ): Till the criterion is satisfied repeat the steps ( iii ) to ( viii ) 

Details of each of the step are as follows : 

Step ( i ) Data Preprocessing 

Performance of the model is sensitive to the scaling of the data. So the data is 
scaled between Xiow and Xhigh with the formula 

‘V" _i_ / v V \ unsealed ^min ^ 

scaled -^low V ^high ” -^low y y V \ 

V'^max •^min ) 

where X min and Xmax are the minimum and maximum of the data to be scaled, Xunscaied ris 
the raw value, Xscaied is the normalized value. From numerous experiments, it is suggested 
that input should be scaled between 0 and 1.0 to obtain the better performance. 

Step ( ii ) Fuzzification of data ( if necessary ) 

If the data available is crisp then for the analysis, it is fuzzified using a suitable 
fuzzification method. The fuzzification method used in the thesis is given in Appendix. If 
the data is already in the fuzzified form then this step is avoided. 



Step ( Hi ) Initialization of model parameters 

With conventional regression analysis, the model parameters are initialized before 
actually analyzing through fuzzy regression modeling. 

Step ( iv ) initialization of 5, ^ ' terms 
From the model defined as below 

— — n — ^ 

r = Ao-hJ^ajXj where a^. e R, Ao,X j e Ft{R), j =1,2,.. ..n 

;=i 

i.e. 

Y. = ~Ao+^ajX,j i=l,2, m. 

;=i 

The product of aj, Xi j is defined differently if aj > 0 or aj<0 the formulation of the least- 
squares optimization problem is dependent upon the sign of coefficient parameter of each 
input variable. To overcome the difficulty in evaluating the metric as described in the 
section 2.1.1, a generalized approach is proposed by introducing new variables. Let Sj,s’j 

be two variables whose value is updated according to the sign of the coefficient aj where 

j=0,l,....n. 

i.e. 

Si = 1 andsi'=-l if i* coefficient parameter sign is positive 
Si = -1 and Si'=l if i* coefficient parameter sign is negative 

Step ( v ) : Evaluation of classical function 
The distance measure is evaluated as 

(Ml): minimize r(aj) = dLR^ where dta^ is given by 


m 


'( " ^ 




d\R=^d^ 





j, 

Z=1 


A ^■=> 


\ A 

/ 


i 

where 

i 



thus the classical function is 




+ a. 


n n 


step ( vi ) : Partial Differentiation of Classical Function 

The parameters aj where j=0,. . .n can be determined by classical minimization of the real 
valued function. Partially differentiating the classical function with respective to each par- 
ameter of the model 

r. o 

Parameter a, : — — =0 . . .CD 


Parameter a. 




Parameter a„ 


....(in) 


Parameter a. 


....(IV) 


By partial differentiation of the classical function, the resulting algebraic system of n+3 
equations in the n+3 unJcnowns are as follows 


Eq. (I) results the following n equations 


/ \f " ') / '{( ” If” 

V /=1 J y=i J K >=1 

( { '• ) ( " ^11 

+ K, -xprA +Z^y% -y^o - ^Po +Z^;V„ -yp: = ^ 

V V /=• J V v=i JJ J 


where / = 0,1, n. 




Eq.(IV) results the following equation 

'N|'\ 

^J^Pr„-ya, =0 ( 2 . 8 ) 

)) 

Step ( vii ) Solution to the simultaneous equations 

These equations are solved in a straightforward way using matrix inversion technique and 
the parameters of the model are obtained. 

Step ( viii ) Evaluation of the stopping criterion 

The new parameters are compared with the old parameters and the process is stopped 
when the new parameters are sufficiently close enough to old parameters i.e. 
max |aj(count)-aj(count- 1 )|<E where j=0, 1 . . . .n and count is the iteration number and s is 
sufficiently small positive real value. 

variation 2:crisp coefficient parameters, fuzzy constant parameter 

The variation 2 can be treated as a special case of variation 1 thus (FI) mildly gen- 
eralizes (F2). In evaluating the parameters of model (F2) the constant parameter is treated 
as a crisp value, the resulting n+1 equations are solved for n+1 unknown parameters. 




2.1.4 Solution of fuzzy least square problem : Crisp input and fuzzy output 

variation 3: fuzzy coefficient parameters, fuzzy constant parameter 
Solution of fuzzy least square problem with crisp input and fuzzy model parameters, 
comprises essentially the following steps 

Step ( i ) : Preprocess or transform ( scale ) the data. 

Step ( ii ) ; Fuzzify the output ( if necessary ) 

Step ( iii ) : Evaluate the classical function basing on <^lr metric 
Step ( iv ) : Get the simultaneous equations by partially differentiating the classical 
function with respective to each parameter. 

Step ( V ) : Solve the simultaneous equations to get the model parameter 

Details of each of the step are as follows ; 

Step ( i ii) : Evaluation of classical function 

Suppose that data pairs Xjj , 7, i=l,2,..m and j=l,2....n are observed, where the real 
numbers Xjj are non-negative and each is fuzzy. The model can be defined by 

- ~ n ~ - 

Y = Ao+^Aj Xj , where e R , A^,A^ e Fj^(R) 

7=1 

i.e. 

Yi = Aq + ^AjX^j .i= 1,2, m 

;=i 

is to be fitted to the data with respective to the best dtR-fit. 

Assuming Xio=1.0 for i=l,2...m one can write (1) as 

.i=1.2, m 

j =0 

Considering distance as defined earlier the classical function to be minimized is 
evaluated as 

m n — - 

(MR): minimize r(Aj) is 

t =1 I /=0 



Assuming Y and A to be fu 2 zy LR numbers given by 


.m 


~ (f^mi ’^a, ) i 0,1,2,. . ,n 

Then the classical function can be expressed as 
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Step ( iv ) : Partial Differentiation of Classical Function 

The parameters aj where j=0,. . .n can be determined by classical minimization of the real 
valued function. Partially differentiating the classical function with respective to each par- 
ameter of the model 


Parameter a„ : 

m. 

II 

o 

/=0,l,...n 

(I) 

Parameter a„ : 

o 

II 
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Parameter : 
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/=0,l,...n 
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By partial differentiation of the classical function and equating every equation to zero, the 
resulting algebraic system of 3n+3 equations in the 3n+3 unknowns are as follows 


Eq. (I) results the following n+1 equations 



r n ^ 


^ n 
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where 1 = 0,1,2, n+1 



Eq.(II) results the following n+1 equations 

m ^ n n 

(=1 \^_,=0 j=0 



Eq.(III) results the following rt+l equations 


where 1 = 0, 1,2 n+1 


1=1 


v'=o 


;=0 


= 0 


where 1 = 0,1,2 n+1. 

Step ( V ) Solution to the simultaneous equations 

These equations are solved in a straightforward way using matrix inversion technique and 
the parameters of the model are obtained. 


variation 4:crisp coefficient parameters, fuzzy constant parameter 
The variation 4 is defined as follows 

variation 4: V = Ao+'^ ajX. , where Xj ,aj g R, e Fj. (R) 

1=1 

It can be clearly noticed that the variation 4 is a special case of variation 3 with crisp coef- 
ficient parameters of each input and fuzzy constant parameter. Thus applying the same pr- 
ocedure for variation 4 the n+3 unknown equations are obtained from the classical func- 
tion and the n+1 parameters of the model are obtained by solving them using matrix inve- 
rsion technique. 

variation 5:crisp constant and coefficient parameters with crisp input 
The variation 5 is defined as follows 

n 

variation 5 : Y = aQ+'^ajXj , where Xj ,aj,aQ eR 

y=i 

Variation 5 is no thin g but the conventional regression analysis. This is treated as a special 
case of fuzzy regression analysis for the convenience in generalizing the fuzzy regression 
analysis. The analysis of this variation is carried out in the conventional way. 
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comparison of different modeling functions in simpi.regr* on estimation of 

life of converter lining problem (median, R&D) 




















2.1.5 Results and discussion 

The models developed in the previous sections are applied to the problems of esti- 
mation of life of converter lining and modeling of Box Jenkins’ gas furnace. For all the 
problems, the fuzzification constants are kept as follows 

ain= 0.9 pin = 0.8 

0Cout~ 1 -0 Pout~ 1 -.3 

and the data is scaled between 0.0 and 1 .0 
2.1.5.1 Results for estimation of life of converter lin ing 

i. Results for estimation of life of converter lining (ICA) 

From the Table A it can be deduced that the variation 1 and variation 2 has large 
training error compared to variation 5 but a lower prediction error. For example the train- 
ing error of the variation 2 using tan hyperbolic ( tank) modeling function, in TableA has 
a training error of 5.97 but the prediction error is 0.36 where as the variation 5 for the 
same modeling function has a training error of 2.53 but the prediction error is much hig- 
her as 5.18. In TableA for variation 4 using exponential modeling function the tr ainin g 
error is 2.47 and the prediction error is 17.25. A typical comparison of modeling functions 
for variation 5 can be seen in the Fig 2. 1 and Table 2.1 It can be seen that the sin and tank 
give good results for all variations and the exponential modeling function shows a worst 
performance. Also comparison of all variations for sin modeling function can be seen in 
the Fig 2.2 and Table 2.2. It can be observed that for sin modeling function variation 5 sh- 
ows better performance with the training error 2.02 and prediction error 2.4. 

ii. Results for estimation of life of converter lining (median, R&D) 

The results of the problem are given in Table B. From these results it can be obse- 
rved that the trend followed in this problem is same as in the case of (ICA) problem but 
the effect of the modeling function has changed in this case. For the variations 1 and 2 the 
sin and tank modeling functions give good results, but for variations 3, 4 and 5 they give 
poor results. For example, the variation 1 using sin modeling function has the training 
error 7.61 and prediction error 16.78 where as for variation 5 using sin modeling function 
the training error is 2.19 and the prediction error is 33.65. For variation 3, 4 and 5 expone- 
ntial modeling function is giving good results in the prediction. A comparison of mode- 
ling functions for variation 5 is given in the Table 2.3 and Fig 2.3 from the graph it can be 
observed that the effect of modeling functions on the training is low for all the variations. 
Also the comparison of all variations for exponential modeling function can be observed 



o 


c 


. o 


33 


<0 


E 


33 

<0 


0 

Q 

c 

o6 

o 

cc 

X, 

c 

m 

© 

2 

Xi 

_• <B 

a E 

E 


IS 

O) 

c 

.£ 


c 

(0 

33 

c 

1*. 

o 

O 

33 

r 

2 

0 

"w 

> 

m 

C 

> 

o 

mmm 

u 

CO 

M- 

o 

o 

j2 

c 


o 


m 


%m 


CO 


Q. 


E 


o 


o 



o 

CO 




ITJ 

CM 


O 

CM 


( % ui ) iOJJd UBaui 


c 

o 

u 

c 

3 


to 


^ Q 

c 

o 


o oej 




.£ 

© 

> 


E c 

c .2 

CO 35 



33 © 



c £ 



O — ' 
5 03 



i = 

c 


X C 

.2 


© 33 

_© 


c S 

3 > 

© 

> 




L- E 


c 

03 O 


o 

© U 

CO 

33 

,2 

*Q. ^ 

c 

.2 

'C 

© 

£ ^ 


05 

> 


CM 

C 

o 


CO 

> 


c 

o 


> 

o 

(3> 

Q. 


c o 

c .2 
o >♦-* 

«o £ 
IZ 33 
m w 
> « 

*<5 o 

II 

ll 

s ^ 

a c 

£ 

o 

u 


© 

> 


CM 

LL. 


prediction statistics 1 

1 


CO 

N. 

d 

to 

CO 

o 

CO 

d 


u 

O 

1. 

© 

E 

E 

lO 

CM 

'T- 

to 

CM 

T- 

C73 

CO 

2.01 

I 

o 

I. 

tm 

<0 

X 

© 

£ 

CD 

CO 

T“ 

CD 

CO 

r^ 

CO 

CM 

CO 

to 

to 

CO 

CO 

|v* 

error std 

CO 

h- 

CM 

CD 

CM 

XT 

CM 

id 

T— 

r- 

r-. 

td 

▼“ 


rms error 

CM 

CD 

CM 

03 

16.21 I 

CO 

CD 

I 

ms error 



CO 

CM 

id 

to 

CO 

td 

I 

L. 

o 

u. 

i. 

© 

E 

E 

00 

C53 

CO 

CM 

OO 

03 

cO 

CM 

CO 

00 

t^ 

1 

id' 

IS 

8 

_ 


■ 

■ 

■ 

■ 

Ki 

& 

nlng statistics 

slope 1 




CO 

03 

d 


k. 

o 

Im. 

k. 

© 

E 

E 

'T* 

to 

CD 

T— 

to 

d 

1^ 

d 

Xf 

O 

d 

1 

k. 

o 

k. 

k. 

© 

X 

© 

E 

CD 

d 

CM 


[ 

CO 

xr 

1 

error std 

CO 

M- 

CO 




1 

© 

u. 

rms error 

rv. 

CM 

CM 

h- 

d 




ms error 

to 

CJ> 

d 

to 

C3> 

d 

00 

o 

d 

80*0 

1 


mn.error 

03 

CM 

C33 

CM 

1%: 





variation type 

variation 1 

variation 2 

variations 

variation 4 



O 

C 

3 

D3 

C 


CO k. 

c 

^ 0) 
CO > 

2 I 

Q. O 

I o 
to (o 
c ~ 

i o 

o c 
33 O 
CB 33 

*n r8 

>i 

= CO 

to <0 

Mm 

o o 
c P 

si 

w o 

8 § 


CM 

JS. 

X2 

CO 


I 


stmpi.regr means simple regression model 

mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 














comparison of different modeling functions for simpl.regr* on problem of estimation 
of life of converter lining {mean, R&D) 
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comparison of all variations In simpi.regr* on problem of estimation of life of 
converter lining (mean, R&D) 
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max error = maximum error, min error = minimum error 
shaded row represents the best performance 































comparison of different modeling functions for simpi.regr* on estimation of 
life of converter lining problem (PCA) 



*8lmpl.regr means simple regression model 

mn error = mean error, ms, error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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^shaded row represents the best performance 



































































2. Fuzzy opposite 

-M= -(m,a,P)=(-m,a,P) R-L numbers 

3. Fuzzy subtraction 

The subtraction has sense only between L-R and R-L, L-L and L-L,R-R and R-R 
numbers (not between R-L and R-L numbers) 

M-N = M+(-N) 

4. Fuzzy inverse 

l/m= (l/m,p/m^,ot/m^) R- number 

5. Fuzzy multiplication 

Three cases are distinguished: 

Case A: If m>0 and n>0, then 

MN=(m,a,P)(n,Y,5)=(mn,na-m5,nP-my)=-[(-(m,a,P))(n,Y,5)] 

Case B: If m<0 and n>0, then 

MN=(m,a,P)(n,Y,5)=(mn,mY+ma,m5+mp) 

Case C: If m<0 and n<0, then 

MN=(m,a,p)(n,Y,5)=(mn,-np-m5,-na-mY) = -[(-(m,a,p))(-(n,Y,6))] 

6. Fuzzy division 

M/N=M.(l/N)=(m,a,P)(n,Y,6)+(mn,(m6+na)/n^,(mY+nP)/n^) 

A.2 Method of Fuzzification 

The method of fuzzification used in the present work is a simple method where 
the standard deviation of all the samples for each variable is used as the spread of the 
fiizzy number of that corresponding variable. The spread can be multiplied by any 
numeric constant in order to weigh the spread. The fuzzified number of any variable x is 

given as X = (x,aa,pa) where a and P are constants, a is the standard deviation 
of X in observed samples. 

A.3 Method of Defuzzification 

the crisp value of a fuzzy number is evaluated by defuzzifying tlie corresponding 
fuzzy number with a suitable defuzzification technique. The present work applies the ' 
standard center of gravity method to defuzzify the fuzzy number. 


95 



comparison of modeling functions in simpi.regr* on problem of modeling gas furnace 
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* simpi.regr means simple regression model 

mn error = mean errcx, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 











































comparison of ail variations in simpi.regr* on problem of modeling of gas furnace 
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* simpi.regr means simple regression model 

mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
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shaded row represents the best performance 





























in the Fig 2.4 and Table 2.4 that the variation 5 shows better performance for the expone- 
ntial modeling function. 

iii. Results for estimation of life of converter lining (mean, R&D) 

The results are tabulated in Table C. From these tables it is observed that the trend 
of variation 1 and 2 having poorer training but better prediction is retained. Modeling fun- 
ctions have considerable effect on both training and prediction. A typical comparison of 
modeling functions for variation 1 can be seen in the Fig 2.5 and Table 2.5. From the gra- 
ph it can be seen that exponential modeling function reduces the tr ainin g error but the pre- 
diction error increases, i.e. the traming error is 1.67 where as the prediction error is 26.59. 
A comparison of all variations for polynomial modeling function can be seen in the Fig2.6 
and Table 2.6, From the graph it can be observed that the variation 1 shows better perfor- 
mance than the other variations. For variation 1 the training and prediction errors are 6.65 
and 16.79 where as for variation 5 the training error is 1.6 and prediction error is 27.42. 

iv. Results for estimation of life of converter lining (PCA) 

The results are tabulated in Table D. From these tables, again it is observed that 
the trend of variation 1 having poorer training but better prediction is retained. Modeling 
functions have considerable effect on both training and prediction. A typical comparison 
of modeling functions for variation 3 can be seen in the Fig 2.7 and Table 2.7. From the 
graph it can be observed that polynomial modeling function reduces the training error but 
the prediction error increases, i.e. the training error is 2.35 where as the prediction error is 
55.1, but for tank modeling function the traming error is 2.43 and prediction error is33.57. 
A comparison of all variations for tank modeling function can be seen in the Fig 2.8 
and Table 2.8, From the graph it can be observed that the variation 3 shows better perfor- 
mance than other variations. For variation 3 the training and prediction errors are 2.43 and 
33.57 where as for variation 5 the training error is 1.08 and prediction error is 37.85. 
2.1.5.2 Results for modeling of Box Jenkins’ gas furnace problem 

From the results tabulated in Table E one can observe that the conventional regres- 
sion i.e. variation 5 yields good results when compared to other variations. The m i n i m u m 
training and prediction errors of the variation 5 are 0.48 and 0.48 respectively, but in vari- 
ation 1 the training and prediction errors are 1.19 and 1.12 respectively. The effect of mo- 
deling function can be observed in the Fig.2.9 and Table 2.9. For this problem the simple 
polynomial modeling function gives better results. The effect of variation upon this prob- 
lem can be seen in the Fig 2.10 and Table 2.10 for the polynomial modeling function. 



2.1.6 Conclusions 


In this present work, fuzzified least square regression modeling is developed. The 
effect of various modeling functions is studied for different variations of fiizzy regress- 
ion models. Fuzzyfying the input leads to a poorer training but a better prediction, i.e. the 
variation 1 and 2 always shows a better prediction through the training is worse. The eff- 
ect of modeling functions upon performance is totally dependent upon the input variables. 
The performance of each modeling function with every variation is almost constant. With 
box data the effect of modeling function is not considerable as the variation in data is very 
low. 

2.2 A.R.M.A. 

2.2,1 Introduction 

Auto regressive moving average class of models [ 3 ] has become one of the most 
popular time series forecasting models. Quantitatively, especially to simulate and predict 
a time series one models it as the out put of the dynamic system whose input is white noi- 
se. Such a model can be described in several ways, but if parsimonious parameterization 
is required then ARMA model is employed. Suppose a time series Y(t), t = 0+1,.. is cons- 
idered . It is modeled as the out put of the system whose input is a white noise £(t) and 
employing ARMA(p,q) with the representation as 

F(t) + a(l)F(t - 1) + + C'(p)r(t - p) 

= ^(r)-f ^(l)^(r-l)+ + /^(q)s(t-q) 

or in operator notation 

g(Z)y(t) = A(£)s(t) 

where 

L is the lag ( or backward shift) operator, LY(f) = Y(t-l) 


g(z) = 1+ a(l)z+ +a(p)z‘’ 

h(z) = l+(3(l)z+ +P(q)z'' 


a" =E[ls(i)| is the variance of the white noise s(t). 

For this model to represent a stationary time series, the roots of the characteristic 
equation g(z) = 0 must lie outside the unit circle in the complex plane. An ARMA (p,q) 
model for a stationary time series has parameters 

a(l), 

and recursive least squares method is applied to estimate these parameters. 



The present work extends the ARMA model to the identification of a multi input 
single output system as suggested in [ 12 ]. It presents a way to fu 2 zify the developed 
ARMA models using all the variations developed in the fuzzy regression analysis. A com- 
parison of performance for conventional and fiizzified ARMA is provided, with two exa- 
mple problems, estimation of life of converter lining and modeling of Box Jenkins’ gas 
furnace. 

2.2.2 Formulation of fuzzy ARMA 

Formulation of fuzzy ARMA comprises essentially the following steps 
Step ( i ) : Preprocess or Scale the data ( as described previously ) 

Step ( ii ) : Formulate [ Y(N) ], [ <j) (N) ] and [ 0 ] from the observed ( available ) data 
Step ( iii ): Fuzzify [ Y(N) ] and [ (|)(N) ] ( if necessary ) ( as described previously ) 

Step ( iv ) : Evaluate the model parameters by fuzzy least square regression modeling 
Details of each step is given below; 

Step ( ii ) Formulation of [ Y(N) ], [ (t)(N) ] and [ 0] 

Let Y is the output vector of m number of samples and Xj j = 1,. . .n is the vector 
of m samples of j input variable, n is the number of input variables. Extending the conc- 
ept of stationary time series modeling to multi input single output system as suggested 
by [12], a simple ARMA(p,q) model is considered for the system described above. 
Assuming p=q, the model can be written as 

y{k) + 2 a^yik - 0 = Z Z " ^’) ^ ^ ’ 

1=1 y=i 

p is the order of the model, 
ai, bij are parameters describing the system. 

Let 9 = J 

and (f)ik) = [- y(k - \)-y(k - 2),....x^{k ^„{k-m)J k=m,m+l, N 


0 ( n ) = [<!)(«), (t)(A)f 

7(A) = [y(4 .y(iV)r 


Step ( iv ) Evaluation of model parameters 

Formulating the ARMA in the same method as the fuzzy regression analysis, the 
data is fitted in the sense ofbest fit with respective to the metric as described in section 
2.1. where the corresponding least square model is represented as 



comparison of different modeling functions in ARMA on estimation of life of 
converter lining problem (ICA) 



max error = maximum error, min error = minimum error 
shaded row represents the best performance 










































comparison of ail variations in ARMA on estimation of 
life of converter lining problem (ICA) 



mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 

max error = maximum error, min error = minimum error 

shaded row represents the element corresponding to the best performance 







































comparison of different modeling functions In ARMA on estimation of life of 
converter lining problem (median, R&D) 



comparison of all variations in ARMA on the problem of estimation of 
life of converter lining ( median, R&D) 



mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 









































p(n+I) 

y{k)= S6y'(>;(*) 

J=1 

Models with all the variations and dififerent modeling functions are developed. 

2.2.3 Results and discussion 

The models developed in ARMA are applied to the problems of estimation of life 
of converter lining and modeling of Box Jenkins’ gas fiamace. For all the problems, the 
fuzzification constants are kept as follows 
ain= 0.9 Pin = 0.8 

Clout” 1 -0 Pout” 1 -3 

the data is scaled between 0.0 and 1.0 and the optimum order of the model is found to bel 
2.2.3.1 Results of estimation of life of converter lining problem 

i. Results of estimation of life of converter lining problem ( ICA) 

From the results given in Table A, one can observe that the variation 5 withi'/« 
modeling function is giving better results, training error is 0.39 prediction error is 2.58. 
But variation 2 gives a better prediction with prediction error 1 .43 though the training 
error is 5.37. For all variations the exponential modeling function gives the best training 
but a poor prediction, for example variation 5 has 0.04 training error but the prediction 
error is 13.93. This can be observed in the Fig 2.1 1 and Table 2.1 1 where a comparison of 
all modeling functions is brought out. From Fig 2.12 and Table 2.12 a comparison of all 
the variation can be observed for sin modeling function. Variation 5 gives better training 
and prediction results, with training error 0.39 and prediction error 2.58. 

ii. Results of estimation of life of converter lining problem ( median, R&D) 

From the results given in Table B, it can be observed that the variation 5 with exp- 
onential modeling function is giving better results, training error is 0.1 prediction error is 
5.22. But variation 4 gives a better prediction with prediction error 4.63 though the trai- 
ning error is 2.37. For ail variations the tank modeling function is giving the poor training 
and prediction, for example variation 5 has 2.24 training error and the prediction error is 
40.44. This can be observed in the Fig 2.13 and Table 2.13 where a comparison of all the 
modeling function is shown that the exponential and polynomial modeling functions 
shows good performance. From Fig 2.14 and Table 2.14 a comparison of all the variation 
can be observed. 



comparison of all variations In ARMA on estimation of life of converter lining 

problem (mean, R&D) 



mn error = mean error, ms error « mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 









































comparison of different modeling functions In ARM A on problem of estimstlon 
* of life of converter lining (mean, R&D) 
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mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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converter lining problem (PCA) 




mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 

max error = maximum error, min error = minimum error 

shaded row represents the element corresponding to the best performance 





































comparison of different modeling functions In ARMA on gas furnace problem 
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max error = maximum error, min error = minimum error 
shaded row represents the best performance 








































comparison of all variations In ARMA on gas furnace problem 
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suits of estimation of life of converter lining problem (mean, R&D) 

The results obtained are given in the Table C. From the results it can be deduced 
e variation 2 with sin modeling function gives good results, the training error is ; 
ndpredictionerror is 19.93. In this problem the variation 5 has better training but 
irediction, i.e. the training error is 0.95 where as prediction error is 45.0. This can be 
/ed from the Fig 2.15 and Table 2.15. where a comparison is brought out for all var- 
5 with modeling function. A typical comparison of the performance of all model- 
mction is shown in the Fig 2.16 and Table 2.16. It can be observed from the graph 
le sin and tank modeling functions show better performance, 
isults of estimation of life of converter lining problem (PCA) 

The results obtained are given in the Table D. From the results it can be deduced 
ariation 2 with exponential modeling function gives good results, the training error 
and prediction error is 2.0. In this problem the variation 5 has got better training but 
mediction, i.e. the training error is 0.05 where as prediction error is 7.48. This can be 
ved from the Fig 2.17 and Table 2.17, where a comparison is brought out for all var- 
s with exponential modeling function. A typical comparison of the performance of 
adeling function is shown in the Fig 2.18 and Table 2.18 where one can observe that 
lential modeling function shows better performance with 1.6 training error and 2.0 
otion error. 

.2 Results of modeling of Box Jenkins’ gas furnace 

From the results given in Table E. it can be observed that the variation 5 with sin 
ding function is giving better results, training error is 0.45 prediction error is 0.1 . For 
iriations the tank and exponential modeling functions are giving the poor training as 
as prediction, variation 5 has 0.54 training error and the prediction error is 0.33. This 
le observed in the Fig 2.19and Table 2.19 where a comparison of all the modeling fli- 
n is shown that the sin and polynomial modeling functions shows good performance. 
1 Fig 2.20 and Table 2.20 a comparison of all the variation can be observed. Variation 
)ws a good performance when compared to rest of the variations. 

Conclusions 

In this present work, fuzzified ARMA is developed by applying the fuzzy least 
res to the conventional ARMA technique. The effect of various modeling functions is 
,ed for different variations of fuzzy ARMA models. From the results discussed above 
a be concluded that fuzzyfying the input leads to a poorer training but a better predic- 



on, i.e. the variation 1 and 2 always shows a better prediction through the training is 
^orse. The performance of each modeling function with every variation is almost 
onstant. With Box data the effect of modeling function is not considerable as the variat- 
m in data is very low. 

,3 Cluster Wise Regression 
.3.1 Introduction 

Regression analysis is generally used in the model-fitting of observations. Thehet- 
rogeneous problem in the regression model is usually difficult to be handled. The hetero- 
eneity of observed samples is because of different clusters of observations. In the present 
/ork, the observed samples of the system are divided into optimum number of clusters 
len a separate model is developed for each cluster of observed samples. This facilitates 
le handling of heterogeneity in the data. All the variations of fuzzy least square regress- 
m developed in the chapter 2 are used for developing the models for each cluster. Comp- 
rison of performance of different clustering methods for different variations is presented 
/ith two problems, estimation of life of converter lining and modeling Box Jenkins’ gas 
umace model. 

.3.2 Formulation of cluster wise regression modeling 

Formulation of cluster wise regression modeling comprises of the following steps, 
step ( i ) : Preprocess or Transform ( scale ) the data ( as described previously ) 

itep ( ii ) : Classify the data based on the independent variables 
Itep ( iii ) : Fuzzify the data ( as described previously, if necessary ) 

Itep ( iv ) : Estimate the parameters of each model through fiizzy least square regression 
Modeling. 

)tep ( ii ) Classification of Data 

Let q be the set of observed (Xi,Yi ),/=!, qn. Suppose these observations 

ire heterogeneous and fi:om different clusters of observations. The samples of observation 
ire divided into c clusters considering each input as a feature. For clustering the data, met- 
lods of clustering are employed. 

1 . Fuzzy c-means clustering 

2. k-means clustering 

S.self organizing map (SOM) clustering 

4.Adaptive Resonance Theory 2 (ART 2) clustering 



5.fuz2y Adaptive Resonance Theory (fuzzy ART) clustering. 

Step ( iv ) Estimation of Model Parameters 

The data set of each cluster is fitted in the sense of best fit with respective to the 
metric dm (as described in chapter 2), then the corresponding fuzzy least squares model to 
the model is represented as 

n 

and A:=l,2, c 

j=i 

where a^o and a'^j are the unknown fuzzy parameters describing the model and x.j is the 

j* input of i* sample belonging to k* cluster. Different models are developed with all the 
variations and different modeling functions as derived in section 2.1. 

2.3.3 Results and discussion 

The developed models for clustering regression are applied for the estimation of 
life of converter lining problem. In all the problems the input data is sealed between 0 to 
1. For the SOM the neighborhood size is varied between 0 to (number of clusters -1), the 
period is kept as 5 and the maximum limit of iterations is kept as 75. The factor a is kept 

0. 15. For .ART2 the vigilance factor is varied between 0.9 to 4.6 and optimum value is 
found to be 3.2.For fuzzy ART2 the vigilance factor is varied between 0.5 to 0.99 and the 
optimum value is found as 0.76 . The factor a. and |3 are kept as 0.18 and 0.5 as per the 
specification in [14 ]. 

2.3.3. 1 Results of estimation of life of converter lining problem 

1. Results of estimation of life of converter lining problem ( ICA) 

From the results given in Table A, one can observe that the training error reduces 
as the number of cluster increases but the prediction has no consistency with the number 
of clusters. In different methods of clustering, for different variations it has different beha- 
vior with the increase of number of clusters. For example, with fuzzy c-mean clustering in 
the variationl, the better performance is with five clusters i.e. the training error is 1.58 
and prediction error is 5.70. In variation 4, it is with two clusters, the training error is 2.43 
and the prediction error is 2.97. Also it can be seen that with the SOM clustering method 
the variation 4 using exponential modeling function has the best performance having a tr- 
aining error of 2.4 and prediction error of 0.29. 

A comparison of performance of modeling function can be studied from the Table 
2.23 and Fig 2.23 or the variation 5 with three clusters using SOM clustering method. 



comparison of all variations in clw.regr’ on estimation or lire or 
converter lining problem (ICA) 




max error = maximum error, min error = minimum error 
shaded row represents the best performance 


































effect of no. of clusters In clw.regr " on estimation or lire or convener 

lining problem (iCA) 



max error = maximum error, min error = minimum error 
shaded row represents the best performance 












comparison of different modeling functions In clw.regr * on estimation of life 
of converter lining problem (ICA) 




max error = maximum error, min error = minimum error 
shaded row represents the best performance 




















max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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* clw.regr means duster wise regression 

mn error = mean errc^» ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 

























































comparison of alt variations In clw.regr* model on problem of estimation of 
life of converter lining (mean, R&D) 



max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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max error = maximum error, min error = minimum error 
shaded row represents the best performance 










































I the graph it can be deduced that exponential modeling shows good performance for 
tion 5 when the data is divided into three clusters. The effect of number of clusters 
le variation 5 and SOM clustering with exponential modeling function is shown in 
'ig 2.21 and Table 2.21, dividing the data into five clusters yields good training but a 
prediction. A comparison of all variations is brought out for the exponential mode- 
fimction with five clusters and SOM clustering method in the Table 2.21 and Fig 
, it can be observed that variation 4 has best performance with training error 2.4 and 
iction error 0.29 

esults of estimation of life of converter lining problem ( mean, R&D) 

For RD problem the results are given in Table C. Same trend can be observed in 
problem also as in the case of ICA. The best performance is given by variation 5 with 
e clusters using SOM to cluster and tank modeling function, the training error is 0.21 
the prediction error is 3. 1 . But the variation four also shows a good performance with 
ling error 2.41 and prediction error 1.20. The modeling function used for this case is 
and the clustering method used is SOM with three clusters. The same trend of fuzzif- 
model having higher training error and lower prediction error is seen here. For this 
blem the effect of number of clusters is shown in Fig 2.25 and Table 2.25. From the 
ph it can be deduced that the division of data into three clusters gives a better perform- 
:e for the model. From Fig 2.24 and Table 2.24 the effect of modeling functions can be 
>erved. It can be deduced that the model with tank modeling function has a better perf- 
nance. A comparison of all the variations is shown in the Fig 2.26 and Table 2.26. 
Results of estimation of life of converter lining problem ( median, R&D) 

The results are given in the Table B. The best performance is shown by the vari- 
on 5 using tank modeling function and K-mean clustering method with three clusters, 
i training error is 0.2 and prediction error is 3.7. But with the variation 2 using polyno- 
al modeling function and K-mean clustering with two clusters has poorer training but 
tod prediction, the training error is 3.99 where as the prediction error is 1.84. The effect 
'number of clusters is shown in the Fig 2.27 and Table 2.27. From the graph one can 
)serv'e that the division of the data into three clusters gives good results for the variation 
VQ. The effect of modeling functions in variation 5 with three clusters using K mean cl- 
rtering is given in the Fig 2.28 and Table 2.28, tank and sin modeling functions give go- 
i results. A comparison of all the variations is shown in the Fig 2.29 and Table 2.29. 
inally a comparison of all the clustering methods is given in Fig 2.30 and Table 2.30. Fr- 
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*clw.regr means cluster wise regression model 

mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 





































c 


.2 

05 


,c: 

o 

’c 

TO 

£ 

0 

Vo 


Q. 


I 

f 


3 


9 


» 

} 

i 

0 

» 

I 

) 

5 



to 



OOh-COtO-^COCNir-O 

dddcicDddd 
( % ui ) JOJia ueaiu 


c 

o 

c 

rt 

Hm, 

O) 

"o 

T3 

O 

E 


c 

o 

c 

o 

a. 

X 

o 

T3 

C 

W 

u> g 
C o 

.2 5 

IS 2 

•Z Q. 
j; o 
> o 
05 2 

.E £ 

® -5 

w to 

■2 S 

o , 
to CO 
C C 
C 9 ^ 

® c 
E 

** X 
05 O 
£ CD 
CO M- 

o o 
5:; ^ 
o — 

^ £ 
c ^ 
— c 
(0 o 

w 

o 

•*-t 

<0 

3 


o 

JQ 

E 

3 


O 

£ 

o 


CO 

c4 

05 

U. 


60 


prediction statistics 1 

© 

a 

jO 

© 

- 


1 

L. 

o 

I. 

1- 

© 

c 

E 

o 

'T- 

d 

0.74 


v« 

O 

k. 

k. 

© 

X 

0 

c 

to 

to 

d 

0.83 

0.77 


error std 

00 

o 

o 

CO 

CO 

d 

<N 

O 

d 

I 

5« 

7.. v’ 

16.. 

rms error j 

xr 

CO 

d 

CNi 

d 

CO 

to 

d 

p 

it 

Hr*', 

ms error 

o 

o 

5 

d 

r 

?'■ 

L. 

o 

b. 

k. 

© 

cz 

£ 



CD 

r^ 

d 

u 

p. 

'ku 

■ 

■ 

1 

■ 

■ 


© 

© 

CL 

O 

© 

- 

■r- 

1 

T: 

Ir ; 

uwj 

f: 

min error 

o 

d 

O 

o 

r 

b 

L 

max error 



CD 

CO 

CO 

W: 

ff. , 

iitesi 

J2 

'tss 

i2 

© 

05 

c 

tz 

error std 

0.53 1 

0.47 

CO 

d 

I;- 

Lik* 

0 

V. 

•««t 

rms error 

0.05 

o 

d 

o 

d 



ms error 

o 

d 

T— 

o 

CD 

o 



mn error 

CD 

d 

0.55 

lO 

d 

1 


no. of clusters 

<N 





C 

o 

o 

c 

3 

«** 

05 

*© 

“O 

o 

£ 


c 

<35 

C 

o 

CL 

X 

05 

rt 

c 

m . 

to 

c 

= 2 

'C ^ 
(Q O 

? 0) 
.3 o 

2 § 


05*^ 
05 O 

1 f 

O "o 
.£^ 

2 E 

(jj j- 

w O 

3 
O 


05 

jQ 

E 

3 

C 


a 

t£ 

05 


CO 

ci 

© 

lo 

cs 


c 

.o 

2 

*> 

3 

T3 

*H 

T3 

C 

OJ 


il 

T3 


O 


T3 

£ 

CO 

3 

cr 

05 

C 

3 

CD 

E 

o 

o 

w 

li 

1— 

o 

u. 

05 

CO 


CD 
05 X3 
•2 2 
0 CO 
-O 3 


o 
E 

S - 

.2 ® 

(/) C- 

2 u 

05 1= 
0 2 
0 Cl> 


! £ s - 

I ^ 

E « 

i £ 

0 Q. 

E 2 


^ o 

«5 i- 
= O 

o ^ 
to as 
C 05 


0 C 

E 

o 

© t 

I-; 05 

5 c 
o e 


0 

o 

c 

0 

£ 

‘i 

0 

CL 


" § 

S 2 

t T3 
0 0 
X X3 
0 ? 
E -g 


I 
















































comparison of different modeling functions In clw.regr moaei on gas lur 

problem 



0 ) 

k. 

o 

(0 

n 

o 

to 


Ui 

c 


<D 

■+•» 

to 

3 

0 
(A 
C 
a 

Q> 

E 

1 

Ui 

S 

« 

3 

lO 

c 

o 


(tj <D 

I- g 
05 F 


£ .p 


$ 

(A 


CO 

o 

o> 

c 



<A 

(A 

C 

c 


o 

c 


<u 

o 

“O 

c 

3 

X 

o 

o> 

m 

c 



o 

o 

U) 

■D 

.S 

o 


E 

'S 

T3 

c 

O 

0) 

E 


c 


o 


o 

c 

o 

tA 


(0 


CVJ 

lO 

00 

CO 

“TT 

CNi 


00 

Q. 

m 

o 



•xr 

xr 

d 

CO 

E 

o 


o 

o 

CD 

o 


d 

o 


( % U| ) joxie ueaiu 

lO 

CO 

CNI 

ja 

u. 



61 


max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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1 the table it can be seen that the K-means clustering method is showing better perform- 
ce of all clustering methods. 

Results of estimation of life of converter lining problem (PCA) 

From the results given in the Table D, it can be observed that variation 3 with tank 
odeling function using the K-means clustering with three clusters, the training error is 
39 and prediction error is 0.33. But one can observe that variation 1 with polynomial 
odeling function and fuzzy C-means clustering with five clusters has a training error 
65 and prediction error 1.51. It can be seen that with the som clustering also with five 
ustersand exponential modeling function the training error is 2.38 and the prediction 
roris 1.39. The effect of number of clusters on the problem of PCA is shown in the 
g 2.3 1 and Table 2.3 1 . From the graph one can observe that the division of the data into 
tur clusters gives good results for the variation 3. The effect of modeling functions in 
rriation 3 with four clusters using K-means clustering is given in the Fig 2.32 and Table 
32. A comparison of all the variations is shown in the Fig 2.33 and Table 2.33. 

,3.3.5 Results of modeling of Box Jenkins’ gas furnace problem 

The results are given in Table E. From the results it can be observed that variation 
using simple polynomial function with fuzz\’ C-mean clustering with four clusters 
lows better performance, the training error is 0.47 and the prediction error is 0.44. The 
ame trend as earlier, can be observed with this problem also. The effect of number 
f clusters is shown in the Fig 2.34 and Table 2.34. From the graph one can observe that 
ivision of the data into five clusters gives good results for the variation 5 using exponent- 
il modeling function. The effect of modeling functions in variation 5 with five clusters 
.sing K-means clustering is given in the Fig 2.35 and Table 2.35. A comparison of all the 
'ariatioris is shown in the Fig 2.36 and Table 2.36. 

:.3.4 Conclusions 

In this present work, the cluster wise fuzzy regression analysis is presented. The 
procedure presented in section generalizes methods of Diamond[ 1 ]. As the classification 
iepends on the underlying principles of each clustering algorithm, the results heavily dep- 
md on the chosen clustering algorithms[ 2 ]. Based on numerical experiments, K-mean 
ffustering algorithm is recommended. The modeling functions have considerable effect 
upon the performance of each model. The choice of modeling function has to be made ba- 
ling on the input data pattern. For s'//!, tank, and exponential the scaling does act an impo- 
rtant role. For a system having low complexity like Box data, the clustering does not show 



impact upon the performance of the model. SOM clustering method though shows 
d performance, its ability to classify the data is very sensitive to the variations in a. 
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*clw.regr means cluster wise regression model 

mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error aid « standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 



Chapter 3 

3.1 Sugeno-type fuzzy identification method 

5.1.1 Introduction 

Sugeno-type fuzzy identification method [ 4 ] is a mathematical tool to build a 
hzzy model of a system where fuzzy implications and reasoning are used. The premise of 
in implication is the description of fiizzy subspace of inputs and its consequence is a lin- 
lar input-output relation. In the present work, the Sugeno-type fuzzy identification is sim- 
ily implemented for the test problems, and results are discussed. 

1.1.2 Implementation of the method 

Implementation of the Sugeno-type fiizzy identification involves the following 

teps. 

tep ( i ) : Preprocess or Scale the data 

tep ( ii ) : At layer one assume an input variable as premise variable and divide the 
input space. 

tep ( iii ) : Evaluate the consequent parameters through recursive least square 
regression, for every premise. 

tep ( iv ) : Premise parameters are readjusted iteratively by complex algorithm 
tep ( V ) ; Evaluate the performance index for the premise variable 

tep ( vi ) : Keep the variable giving the least performance as premise variable for the 
layer one and proceed for the next layer. 

tep ( vii ) : Repeat firom step (ii ) to step ( vi ) till the required performance is not 
achieved. 

etails of each step is given in [4 ]. 

1.3 Result and Discussion 

For all the problems the data is scaled between 0 to 1 using the previously 
entioned scaling technique. From the results given in Table 3.1 and Fig 3.1 it can be 
en that the training is better but the prediction is worse, i.e. in the case of estimation of 
e of converter linin g (ICA), tibe training error is 2. 1 where as the prediction error is 3.8. 



I« this problem the number of maximum layers is kept as 2. For tlie case of Box 
lata the training as well as predictions are good. The results of Box data is matched with 
lie results given in [3]. 


12 Orthogonal parameter estimation technique 
1.2.1 Introduction 

In orthogonal parameter estimation[ 5 ], the premise of tlie model is first determi- 
ted using a fuzzy discretization technique by constructing reference fuzzy sets. This amo- 
■nts to the partition of the input space, as has been done in the previous methods. The nu- 
iber of reference fuzzy sets determines the number of rules and numbers linear equations 
1 the consequent part of the model. The parameters of these linear equations are then est- 
nated using an orthogonal estimator. The present work studies the effect of clustering 
lethod upon the performance of the of each model. 

2.2 Formulation of Ortho-Clustering Technique 

Steps involved in modeling the system through ortho-clustering technique are as 

illows 

tep ( i ) : Preprocess or Transform ( Scale ) the data 

lep ( ii ) : Classify the input space for premise identification 

tep ( iii ) ; Formulate [ (j) ] and [ y ] from the identified premises 

tep ( iv ) : Estimate model parameters through orthogonal least square regression 

etails of each step is given below 

tep ( ii ) Classification of Input Space 

The premise model of the system is determined using the fiizzy discretization tec- 
lique by constructing the reference fuzzy sets through a suitable clustering algoritlim. 
ustering methods used for dividing the input space are 

1. Fuzzy c-means clustering 

2. k-means clustering 



3. self organizing map (SOM) clustering 

4. Adaptive Resonance Theory 2 (ART 2) clustering 

5. fuzzy Adaptive Resonance Theory (fuzzy ART) clustering. 

I ( iii ) Formulation of [ (|) ], [ y ] 

Given an input {u i w^},The discretized form of a input variable u/ is 

■essed as [///;, where I denotes the number of reference fuzzy sets 
rell as number of rules constitut-ing the model [5] .then the total output of the model 
if-erred by taking the weighted average of the local outputs (y*i , ...., y\ ) 

Yk=fkS 






= v;u‘] 


he above equation [ 0 ] is the vector of parameters of the model and v can be 
;ulated as 



i=i 


ere 


jp ( iv ) Estimation of parameters of the model 

To determine which terms to include in the above equation and then estimate their 
rameters, the step wise regression procedure, along with the orthogonal least-square 
;orithm [ 13 ],[ 12 ] is used. The basic idea of this algorithm is to transfer the following 
uation into an equivalent orthogonal equation 

Yk = /k6 

;o equivalent orthogonal equation 
(r+lV 
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here the Wijc’s are orthogonal to one another, with 
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le estimates of the coefficients gi are given by 
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he coefficients of the original equation can easily be obtained according to the formulas 
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=(r+l)*l-l,(r+l)*l-2,....,l. 

)efme the error reduction ratio due to the ith term as 
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thus the significant terms can be chosen if their error reduction ratio is greater that some 
hreshold value. The procedure for transforming the /ic’s to Wik’s is given below (Gram- 
schmidt orthogonalization [ 13 ]). 

[n the first step, all the i=l,2, ,(r+l)*l are considered are possible candidates for 

wik. . For i=l,2,...,(r+l)*l, calculate 
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!ic= is select with 

-(.P) 

= g, and[err\ = [err][^’ . 

. the second step, all the/ ik, i=l,2, (r+l)*l, i?ip, are considered as possible candidates 

ir W 2 k. for 1=1,2,. . .,(r+l)*l, p, calculate 
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iecond term W2k=W2k^^=Pqk^i2'Wik is selected with 

^ ^ ( 9 ) (^) 

ai2 g2 = g2 and [err]2=[err], . 

ITie same procedure will be repeated and terminated at the Ms th step when 
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effect of number of clusters in ortho-clustering model on estimation of life of 
converter lining problem (ICA) 




max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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life of converter lining problem ( mean, R&D) 



mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error sld = standard deviation of error, 
max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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max error = maximum error, min error = minimum error 
shaded row represents the best performance 
















































vrhere p is a chosen tolerance. This gives rise to a subset model con tainin g Mj significant 
srms. 

>.2.3 Results and discussion 

As described in chapter 3, various models are developed using different clustering 
aethods. For each clustering method the slopping criterion , epsilon is varied between 
(.001 to 0.1. The results for epsilon 0.001, 0.005, 0.01, 0.1 are noted. Also the number of 
luster formed also varied between one to five. The parameters for the SOM, ART2 and 
uzzy ART are as follows. As the data is scaled between zero to one for every problem the 
larameters remain the same. For the SOM the neighborhood size is varied between 0 to 
number of clusters -1), the period is kept as 5 and the maximum limit of iterations is kept 
IS 75. The factor a is kept 0.15. For ART2 the vigilance factor is varied between 0.9 to 
1.6 and optimum value is found to be 0.8 .For fuzzy ART the vigilance factor is varied 
)etween 0.5 to 0.99 and the optimum value is found as 0.6. The factor a, and (3 are kept as 
).18 and 0.5 as per the specification in [ 14 ]. 

5.2.3.1 Results of estimation of converter lining problem 
. Results of estimation of converter lining problem (ICA) 

From the results tabulated in Table A, one can observe that the division of data set 
nto two clusters using k-means clustering technique and keeping the stopping criterion 
jpsilon = 0.001 gives best performance wdth training error of 0.64 and prediction error of 
1.04. The effect of division of data can be observed in the Fig 3.2 and Table3.2. that the 
raining error decreases as the number of clusters increase. But from the statistics of the 
)rediction, it can be deduced that the division of data into two clusters gives good perfor- 
nance, increasing the number of clusters beyond two results in worse performance. The 
jffect of stopping criterion can be observed in Fig 3.3 and Table 3.3 that the training error 
ncreases with increase of epsilon, but the prediction is better with the epsilon value equal 
0 0.1 and over all performance is better with 0.001. 
i. Results of estimation of converter lining problem (mean, R&D) 

The results are given in the Table C . From the results it can be seen that having 
wo clusters using k-means clustering method and keeping stopping criterion as 0.005, i.e. 
he training error is 1.29 and prediction error is 6.08. From the Fig 3.4 and Table 3.4 it can 
oe deduced that the division of data into two clusters gives good performance, increasing 
he number ofclusters beyond two results in worse performance. The comparison in the 
Derformance of different clustering methods is brought out in Fig 3.5 and Table 3.5. Fuz- 
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mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std - standard deviation of error, 
max error - maximum error, min error = minimum error 
shaded row represents the best performance 
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max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 

max error = maximum error, min error = minimum error 

shaded row represents the element corresponding to the best performance 
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mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 

max error = maximum error, min error = minimum error 

shaded row represents the element corresponding to the best performance 



effect of no. of clusters In ortho-clustreing model on gas furnace problem 
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max error = maximum error, min error = minimum error 
shaded row represents the best performance 
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same performance but som has got the worst performance. The effect of stopping criterion 
is also different as one can observe from Fig 3.6 and Table 3.6. The training error as eps- 
ilon value increases but the prediction is good with epsilon 0.005. 
lii. Results of estimation of converter lining problem (median, R&D) 

From the results given in Table B, it can be observed that Fuzzy C-means cluster- 
ing shows good performance with two clusters and keeping epsilon to be 0.001, the train- 
ing error is 1.36 and prediction error is 5.67. SOM shows the worst performance with 
training error 3.5 and prediction error 34.7. From the Fig 3.7 and Table 3.7 it can be 
deduced that the division of data into two clusters gives good performance, increasing the 
number of clusters beyond two results in worse performance. The effect of stopping 
criterion can be observed from Fig 3.8 and Table 3.8. The training error decreases as 
epsilon value increases but the prediction is good with epsilon 0.001. 
iv. Results of estimation of converter lining problem (PCA) 

From the results given in Table D, it can be observed that the best performance is 
given by fuzzy c-means clustering with epsilon value equal to 0.005, the data are divided 
into 4 clusters, the training error is 2.8 and the prediction error is 1.75. The effect of div- 
ision of data into different clusters has changed for this problem. From the Fig 3.9 and 
Table 3.9 it can be deduced that the division of data into four clusters has a better perfor- 
mance for prediction i.e. 2.8, 1.75. From Fig. 3.10 and Table 3.10, it can be observed that 
with epsilon error there is over fitting of the model which caused the prediction to be 
worst. The model with epsilon value equal to 0.005 has better performance. 

3.2.2.5 Results of modeling of Box Jenkins’ gas furnace problem 

From the results given in Table E, it can be observed that the best performance is 
given by fuzzy c-means clustering with epsilon value equal to 0.00 1 , the data are divided 
into 1 cluster, the training error is 0.55 and the prediction error is 0.52. The effect number 
of clusters is shown in the Fig 3 . 1 1 and Table 3 . 1 1 and the effect of stopping criterion can 
be observed in Fig 3.12 and Table 3.12. 

3.3. Conclusions 

This work brings out a comparison ofthe performance of various clustering algor- 
ithms and also looks at the effect of stopping criterion, epsilon in determining the signific- 
ant terms ofthe model. Among the clustering methods used, K-means shows a better perf- 
ormance. Division of data into two clusters gives better results. From the results of all the 
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problems one can deduced that by increasing the epsilon value the training gets poorer, 
but the prediction shows an inconsistent behavior. In the case of PCA problem the predic- 
tion is good with epsUon value 0.005 but for aU the problems the results are good with ep- 
silon value 0.001. Also one can observe drat by increasing the number of clusters the 
training gets better but the prediction is good with two clusters for estimation of life of 
converter lining problem. In Box problem the results are good with single cluster, as the 

data has less complexity embedded in it. 
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comparison of different models on the problem of estimation of life converter lining(ICA) 



Table 4.1 comparison of different models on estimation of life of converter lining problem (ICA) 

mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimur clw.regr =dusfer wise regression, regr = regress conv. = conventional 
shaded row represents the element corresponding to the best performance 
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mn error = mean error, ms error = mean squared error, rms error = root mean squared enw, error std = standard deviation of ent^, 
max error = maximum error, min error = minimur dw.regr =cluster wise regression, regr = regress conv. = conventional 
shaded row represents the element corresponding to the best performance 









































































































Chapter 4 


4.1 Results and discussion 

In the present section the performance of all the models is evaluated by selecting 
the best results presented in the previous chapters. 

4.2.1 Results of estimation of life converter lining problem (ICA) 

From the Table 4.1 and Fig 4.1 it can be observed that cluster wise regression with 
fuzzy model shows best performance, the training error is 2.4 and the prediction error is 
0.29. But the conventional regression gives the worst results with training error 2.53 and 
prediction error 5.18. From the Table 4.1 and Fig 4. lit can be deduced that in all the 
methods the fuzzified model has better performance when compared to tiie conventional 
one. In simple regression the conventional model has a training error of 2.53 and predict- 
ion error of 5.18. 

4.2.2 Results of estimation of life converter lining problem (median, R«&D) 

From the results given in Fig 4.2 and Table 4.2 it can be observed that the division 
of data into different clusters has a predominant effect upon the performance of the 
model. In simple conventional regression, the training and prediction errors are 2.19 and 
33.665 where as in the cluster wise conventional regression, the training and prediction 
errors are 0.2 and 3.7 respectively. Also it can be seen that the ARMA method is able to 
estimate better than the simple regression. The conventional ARMA has the training error 
of 0.1 and prediction error of 5.22 which is quite low when compared to the simple conv- 
entional regression. The effect of fuzzyfication is very low for this problem, as it can be 
observed that the training and prediction errors of fuzzy cluster wise regression (3.29 and 
1 .04 respectively ) are not better than the those of conventional one. 

4.2.3 Results of estimation of life converter lining problem (mean, R&D) 

The results are given in Table 4.3. From Fig.4.3. and Table 4.3. it can be seen that 
the fuzzyfication has much better prediction though the training is poorer, i.e. the conven- 
tional cluster wise has a training error of 0.21 and prediction error of 3.1 where as cluster 
wise fuzzy regression has a training error of 2.42 and a prediction error of 1.2. In this 



1 


c O 

g o6 
S OC 

S C 

cx 

fit 

l-i 

O I 
E S 


t 

j> 


0 
c 

1 

n 

tx 

E 

8 




(% u| ) joija ueauj 


Q 

od 

ot^ 


ny 

o 


E 

0) 

JQ 

O 

Imi 

a 

D) 

c 


Q> 

> 

C 

o 

o 


*2 


o 

c 

o 

43 

A 

E 

•43 

to 

Q} 

C 

O 

i2 

*0) 

TJ 

0 

E 

i: 

a> 

1 

Hi. 

o 


«a 

D. 

E 

o 

a 


Ui 

iZ 


87 



o 

Q. 

o 

v> 

x> 

r- 

r“ 

r^ 

1.03 1 

R 

’^,•1 

0.54 1 

CO 

q 

d 

o 

D 

0.94 1 

q 


u. 

0 

tm. 

tm, 

0) 

c 

1 

CO 

T” 

5 

0) 

ID 

T- 

T— 

CO 

v-* 

CD 

23.43 

12.58 

r- 

CM 

CO 

cd 

ID 

h- 

X— 

if) 

u 

43 

CO 

k. 

o 

L. 

k. 

0) 

H 

to 

E 

CO 

Xf 

23.97 

5.04 

$ 

CM 

CM 

0 

67.82 

27.28 

CO 

CM 

cd 

8.82 

CO 

ID 

csi 

T- 

44 

B 

(0 

c 

o 

•43 

O 

error std 

15.99 

CO 

M* 

q 


22.2 

7.35 


i^rz 


TJ 

e 

a 

rms error 


12.91 

2.59 


35.88 

15.02 

6.45 

CM 

N. 

xf 

00 

a> 

00 

ms error 

10.07 

n 

CO 

CO 

CO 

0.13 

m 

it 

id 

CM 

4.51 


0.45 


mn error 

ZVLZ 

16.79 


j 

CO 

CO 

id 

M- 

19.93 

CO 

K 

oo 

o 

d 

CD 






v2 






trainina statistics 1 

o 

CO 

•tfm. 

CM 

q 

T” 

s 

- 

1.01 

T” 

- 

0.98 

k. 

o 

fc 

a> 

c 

*£ 

h- 

o 

CD 

0.45 

CO 

o 

CD 


CM 

CM 

d 


CO 

o 

d 

o 

d 

CM 

CO 

CO 

d 

max error 

r- 

CD 

r-; 

cd 

0.65 



20.06 

CD 

q 

id 

9.4 

error std 

lO 

CO 

s 

id 

0.19 


0.58 

6.21 




rms error 

CO 

LO 

o 

CO 

CO 

CN 

0.08 

r-;-; 

CM 

CO 

d 

2.98 

00 

CD 

d 

r- 

in 

d 

4.59 

ms error 

s 

CD 

0.73 

o 

'4 

¥ 

L- 


1.06 




mn error 

CO 

'r“ 

lO 

CD 

CD 

0.21 

If" 1 

K 

0.95 

8.24 

CO 

XT” 

O) 

CM 

3.72 


me of the model 

t 

8 

k. 

O 

E 

0) 

D 

E 

41 

k. 

D 

E 

<D 

L.C 

E 

s 

g 

o 

E 

t 

1 

ARMAconv. 




neural network 



Table 4.3 comparison of different models on estimation of life of converter iinln9 problem (mean, R&D) 

mn error = mean error, ms error = mean squared error, rms error = root mean squared error, error std = standard deviation of error, 
max error = maximum error, min error = minimur clw.regr =cluster wise regression, regr = regress cxjnv. = cxjnventiona! 
shaded row represents the element corresponding to the best performance 
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problem ARMA method of modeling shows worst performance with a training error of 
0.95 and prediction error of 45.63. 

4.2.4 Results of estimation of life converter linin g problem (PCA) 

From the results given in Table 4.4 and Fig. 4.4 it can be deduced that the flizzyfi- 
cation has much effect upon the performance of the model. The tra inin g and prediction 
errors of conventional cluster wise regression are 0.1 1 and 2.78 where as the fuzzy cluster 
wise regression training and prediction errors are 2.4 and 0.33. In this case also the 
division of data shows better performance than the simple regression and ARMA. The tra- 
ining and prediction errors of conventional regression are 1.08 and 36.85 respectively 
where as the training and prediction errors of conventional cluster wise regression are 
0.1 1 and 2.78. ARMA method of modeling shows much better performa- nee than the 
simple regression. The flizzyfied ARMA has a training error of 1 .6 and prediction error of 
2.00 where as the fuzzy simple regression has a training error of 2.43 and prediction error 
of33.57. 

4.2.5 Results of modeling of Box Jenkins’ gas furnace problem 

The results are given Table 4.5 and a comparison is shown in the Fig 4.5. From the 
results it can be deduced that the fuzzyfication of data does not yield good results. The 
training and prediction errors of simple conventional regression are 0.48 and 0.49 
respectively. But for the fuzzy simple regression the training and prediction errors are 
1.08 and 0.47 respectively. The division of data yields good results for conventional regr- 
ession but for the fuzzyfied data the clustering makes the performance worse. The training 
and prediction errors of cluster wise conventional regression are 0.47 and 0.21 where as 
for the simple conventional regression are 0.48 and 0.49 . The training and prediction 
errors of cluster wise fiizzy regression are 0.88 and 0.62 where as for simple fuzzy 
regression these are 1.08 and 0.47. 



Chapter 5 


1 General conclusions 

This work rnauily emphasizes on comparing the performances of different model- 
g techniques . A new approach to fuzzy least square method is proposed for multi input 
stem. The effect of modeling functions upon the performance of the model is studied, 
izzified models of ARMA and cluster wise regression are developed by applying the 
vZzified least square regression to their conventional models. The effect of different dus- 
ting methods on the performance of orthogonal parameter estimator and cluster wise 
■gression model is evaluated. 

From the results of the example problems it can be concluded that the fuzzifica- 
on of the data decreases t he precision. For the fuzzified model the prediction is better 
van the conventional models though the training poorer. The division of data results in 
etterment of the modeling. ARMA proves to be much better modeling technique to the 
imple least square regression. Fuzzification of the .ARMA yields in better prediction 
ompared to the conventional ARMA for the system having inherent imprecision. For the 
imple system like Box Jenkins’ gas furnace modeling the fuzzification does not yields 
ood results. From results of orthogonal clustering it can be concluded that the k-means 
iustering is more efficient in dividing the input space. 
i.2 Scope for future 

.. The method of implementation for simple fuzzy regression with fuzzified input varia- 
bles and fuzzy model parameters has to be studied. 

1 . Effect of scaling of the data on the performance of the modeling function has to be 
evaluated. 

3. Effect of different fuzzification and defuzzification methods on the performance of 
the model has to be evaluated. 

4. An information criterion has to be developed for determining the optimum number of 
clxisters for better classification of the data. 

5. Feasibility of expending the existing mformati on criterion such as AIC andBIC [16] 
for determining the optimum order in conventional ARMA, to fuzzy ARMA has to be 


evaluated. 
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Appendix 

1.1 L_R FUZZY REAL NUMBERS 
Fuzay Real Numbers 

Let R be the set of real nunibers and hgR a given real number. Form the real 
lumber n one can constract a fuzzy real number N as a fuzzy set that covers number n. 
When the fuzziness of N is removed it reduces exactly to n[2 blue]. 

Definition A.Jf .* N is a fuz 2 y real number if and only if: (i) it is a fuzzy subset of 
the set R of real numbers, (ii) its membership function ^n(x) has the following properties 

a. m.n(x) is a continuous function 

b. Vxe(-oo,c], p.n(x)=0 

c. {1 n(x) is strictly increasing in [c,a] 

d. Vxe[a,b], !LiN(x)=l 

e. ^n(x) is strictly decreasing in [b,d] 

f. Vxe[d,+oo), )j,n(x)=0 

Let y(x) be a real function y(x):R ->[0,1] that prossesses the properties of |j.n(x) of 
definition A.l, and can be considered the membership fimciton of a fuzzy real number. 
Definition A.2: Let L(x) and R(x) be two fimcitons satisfying the conditions of Def. A.l 
Then as L_R fuzzy number M around the classical number m, is defined a fuzzy number 
M with membership function: 

L((m-x)/a),x<=m 

liM(x)= 

R((x-m)/p),x>=m 

Where m is the modal value and a,P are spread parameters. Since the L_R fuzzy 
number is fully determined by the triad of parameters mja and P it is symbolised by: 
M==(m,a,P). 

Theorem A.1 

Let two L_R fuzzy numbers M=(m,a,P) and N=(n,y,5). Then one can show [2] the 
following: 

1. Fuzzy addition 

M+N=(in,a,P)+(m,y,5)=(m+n,a-i7,p+5) 



2. Fuzzy opposite 

-M= -(m,a,P)=(-m,a,p) R-L numbers 

3. Fuzzy subtraction 

The subtraction has sense only between L-R and R-L, L-L and L-L,R-R and R-R 
numbers (not between R-L and R-L numbers) 

M-N = M+(-N) 

4. Fuzzy inverse 

l/m= (l/m,p/m^,a/m^) R- number 

5. Fuzzy multiplication 

Three cases are distinguished; 

Case A: If m>0 and n>0, then 

MN=(m,a,p)(n,y,5)=(mn,na-m5,nP-my)=-[(-(m,a,p))(n,y,5)] 

Case B: If m<0 and n>0, then 

MN=(m,a,P)(n,y,5)=(mn,my+ma,m5+mP) 

Case C: If m<0 and n<0, then 

MN=(m,a,p)(n,y,6)=(mn,-np-m6,-na-my) = -[(-(m,a,P))(-(n,y,5))] 

6. Fuzzy division 

M/N=M.( l/N)=(m,a,P)(n,y,5)+(mn,(m5+na)/n^,(my+np)/n^) 

A.2 Method of Fuzzification 

The method of fuzzification used in the present work is a simple method where 
the standard deviation of all the samples for each variable is used as the spread of the 
fuzzy number of that corresponding variable. The spread can be multiplied by any 
numeric constant in order to weigh the spread. The fiizzified number of any variable x is 

given as X = (x, aa. Pa) where a and P are constants, a is the standard deviation 

of X in observed samples. 

A.3 Method of Defuzzification 

the crisp value of a fuzzy number is evaluated by defuzzifying the corresponding 
fuzzy number with a suitable defuzzification teclmique. 1 he present work applies the 
standard center of gravity method to defuzziJfy the fuzzy number. 
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Table 1.4 data sheet for estimation of life of converter lining problem ( median, R&D ) 
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cr = cluster wise regression, fc = fuzzy c-means clustering, v1 ,v2,v3,v4,v5 = variation 1 , 2, 3.4, 5 poi,sin.tnh,exp = poiynomiai,sin,tan hyperboiic 

exponential modeling functions, nc = no. clusters, rd = data prepared using mean with SAIL R&D 's experience, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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cr = cluster wise regression, fc = fuzzy c-means clustering, knn=k-means clustering. v1 ,v2.v3.v4,v5 = variation 1 , 2, 3, 4, 5 pol.sin.tnh.exp » polynomial, sin, tan hyperbolic 
exponential modeling functions, nc = no. clusters, lea = data prepared using ICA, mn error = mean error, ms error = mean squared error 
rms error » root mean squared error, error sld = standard deviation of error, max error = maximum error, min error = minimum error 
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:jr » cluster wise regression, km=k-means clustering. v1 ,v2,v3,v4,v5 = variation 1 , 2, 3, 4. 6 pol.sin.tnh.exp = polynomial, sin.tan hyperbolic 
exponential modeling functions, nc = no. clusters, lea = data prepared using ICA, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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exponential modeling functions, nc = no. clusters, ica = data prepared using ICA, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minitnum error 
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cr = cluster wise regression. at=A.R.T.2 fa = fuzzy A.R.T clustering, v1 .v2,v3.v4,v5 = variation 1 , 2, 3, 4.5 pol.sin.tnh.exp = polynomial, sin.tan hyperbolic 
exponential modeling functions, nc = no. clusters, ica = data prepared using ICA, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 





S S S o O 05' o o 


OOdOT^-r^OQ^ 


fc -sf -tr Gr> O 03^ 
® CNi r^. CO ^ 

e d •«” CNi T~ '*»• fsj 


OD^ r«- CM O 0> «JD CM CM C0 m Uf5 CM rt. ^ T- -r- N. m 

^rvO‘:0OOC53gsO3O>g5^^ggif^^^^pg5<OOOCOa>O> 


CO o d d 


5£ fp: o o S 

^ ^ o o 

^ o ,rt cn ^ 


fsCl flft 03 00 CM CM iTi rfs is^ 

'r-CMCMCM T-Y-T-COCOr>-r^CD 


^ CM ,rt -O CO OD CM ^ T- 
^^CMCMloq^qq 


^ ^ ^ ^ M" (jO CO 

cocotocDcnc3>^‘<=^<=^0)CM5;::3gcqco5[^o>r-T- 

CMCMf^h-r^rsI^^ooro^^^^CMCM^^CNidd 


CO ^ 5 

•10 CO 


T--r-coM‘cpcorocor^ 

CjDODOOOCMCMiO’r- 


M-OCOOOeO'VMTM* 


M* Cr> CTi ®0 CO lO 00 IS- 

oS^«'^o>cocooooocoSSco;J:::::::!^!^o3^'^^ 

^ooin'^^cocococorOT-T- 


c^Jcoco«lrH.^nrt^0(^-,y^JN*{s^^' s- 

o CO r*^ 'M* CO CO M“ M* O ^ CM- CM 'T O O O N- CO 

OOOO'— '»-OOOrMCMCMCMg‘^^'*^2^'^ 


CO CO 00 00 05 CO CO 

CM CM CM CM T- o O 

<D d O C) CD C3 €3 


fc IIJ in «o t73 ♦ff) ■*— ».co i,n ftt cts ?5 

0 M CO <"0 ri D4 *tr '^r ^ 

C 2 CO M* Mr 00 ** MO ® ^ ^ 


o^C;!:;Su?5 59§oor:^:$Jgg®2gr:;;co(a 

ru r\i rvi CM ^ ^ ~ V *■ C V '•^ 


^ CM UO CO CO 

MO c0 CO £:f £f ■£:! £:! 


woSuoS^^^'^'^'^^co 


•r- <M <M 

O O O 


O 05 05 O O O O 
O O ^ ^ 


•r- T- CM 

o o o o 


^ CM CM 
O O O 


T- CM CM 

o o o 


C05cocorMfMif)couo«oao^ocococM<Mxr 

®C000T-00a3'r-'r-OM*-TOOO*«rM'T«“ 

cdooooooooooooodo 

E 


COCMCOCO 

OOOOOCOOOt-OOOOo'^'^COCO 

ddddddddd dddd 



CM 

00 

s- 

fs- 

00 

MO 

CO 

8 


v~ 

a> 

a> 


CO 

tr 

M* 

CD 

CO 

CO 

CO 


MO 

■M* 

CO 

MO 


CM 

CO 


s- 



05 

CO 

OO 


.fH. 

■r- 

CO 

'CO 

00 

-fN- 

r%., 

•CO 

CM 



o 

«o 

CO 

CO 

CM 

CO 

T~ 

CO 


CO 

s- 

T-» 

CO 

CO 

M- 

M- 

CO 

q 

CO 

CO 

m 

N 

■r^- 

-MO 

'MO 

CM 

M* 


CO 

00 

CM 

CO 

d 

d 

o 


00 

od 

CM 


CM 

in 

id 

id 

d 

d 

id 

MO 

CM 

cd 


MT 

M- 

T" 


CM 

CM 


r» 

r“ 




CM 

CO 

CO 





r- 

f" 

T~ 

CM 

T— 

T- 

CM 

CM 

CM 

(M 

CM 


aw 

*^5 


dmooooocoiOtocoo^coSSro^ 


^'«**w<Orrt^r^m‘^COS'h-M'OOT-COCO 

cooD .<0‘Co<D.T~cMxrM;h-oto<oco 
M* M* ® d ^ Mr CD d id oo* c 6 d T-^ cd CD cd 


C 00 M* 05 cr>^ 0> MO 0> "M ^ ^ 
®K^iOcocoroc0M‘a>oo 
« 6 -CM CM d CO o ■«-* -CM CM CM 

E 


efiiCn»CftfiOV'tfM’f^iJD'»^'»*"CDCOM'Cr>CID05S»iOr^COCO 

r§S?§50^ddcM^?0>CM"s^^inoqcocMM:M;coco 

CMdcOddCMCMdd-f-CMCOdcMCMCOCOOOCMCOCO 


5 O ® CO 05 05 o CO O O O CM O O CM CO f^ r-: q CM 'M; CD CM o . -OC) < 3 > 03 o O ^ UJ 

wd'^-Ov-ddd-dT-r-dcddcMCM-dddoooooT-o o-t-t-oocot-t- 


oo W 

T“ IS. ‘ ' 

/si 


Q| T- IS. 

c d id 


O3 00-'f*'r*<DCMCOr*»*-C*-M'C^CMCM'r^'CO'|^COjNC0lO.j>^^Y-*^SQCMCM 

S§oo®q^o>Sqqqqqr^f^qq-:*^;cddS!^'SdcM 

Mrr**‘odcddcd-*do5d-cdiocotoO'COiocoo>CMio ^ ^ 


CMCOM'UO'r-CMrOM-lO'r-CM 

a>oa)a>®a>®<t3©a)0 

iiiittnnt 

P 6 d d .6 6 6 6 6 6 .d 


^^f®'«{f'lOT~CMCO''t‘MO 

T-CMrOM*UOCMCOMriO'r-CMC2M"*£3<D©®©© ©©©©® 
©® ©®®®«» ®®®®®®2^^^^^*C>4CM-CMCMCM 


CM CM CM CM CM 
c3 O O O O 


CCCCCC'CCCC 

EEEEEEE'EEE 


«AWjw©®i2-S5 ^ wwwiSJSiGWw 

8 S S S 3 S S IS 3 S S S S S S J S .§ 1 1 .i .i I S S .3 S S 8 8 S 3 S 


Table A: continued 

oclstr=ortho-clusterlng, fc = fuzzy c-means clustering, km= k-means clustering, e1=epsilon(0.001), e2=epsilon(0.005),e3=epsilon(0.01), e4=epsilon(0.05), e5=ep3ilon(0.1) 
nc = no. clusters, ica = data prepared using ICA, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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Table A: continued 

oclstpsortho-clustering. kh=SOM clustering, km= k-means clustering, e1=epsilon(0.001), e2=epsilon(0,005).e3=epsilon(0.01), e4=epsilon(0.05), e5=epsilon(0,1) 
no = no. clusters, ica = data prepared using ICA, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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cr = cluster wise regression, fc = fuzzy c-means clustering. v1 ,v2,v3,v4.v5 = variation 1 , 2, 3.4, 5 pol.sin.tnh.exp = polynomial.sin.tan hyperbolic 
exponential modeling functions, nc = no. clusters, md = data prepared using median with SAIL R&D 's experience, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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cr = cluster wise regression, fc = fuzzy c-means clustering, v1,v2,v3,v4,v5 = variation 1, 2,3,4, 5 pol,sin,tnh,exp = polynomiai,sin,tan hyperbolic 

exponential modeling functions, nc = no. clusters, md = data prepared using median with SAIL R&D 's experience, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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cr »* cluster wise regression, km=k-m 0 ans clustering, v1 ,v2,v3,v4,v5 = variation 1 , 2, 3,4. 6 pol.sln.tnh.exp = polynomial, sin, tan hyperbolic 
exponential modeling functions, nc == no. clusters, md = data prepared using median with SAIL R&D 's experience, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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cr = cluster wise regression, kh=S0M clustering, v1,v2,v3.v4,v5 = variation 1, 2, 3, 4, 5 pol.sin.tnh.exp = polynomial, sin, tan hyperbolic 

exponential modeling functions, nc = no. clusters, md = data prepared using median with SAIL R&D 's experience, mn error = mean error, ms error = mean squared error 
rnis error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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\C = no. clusters, md = data prepared using median, with SAIL R&D's experience, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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* simpiregr «= simple rigresslon. cr = cluster wise regression, fc = fuzzy c-means clustering, v1,v2,v3.v4,v6 = variation 1. 2, 3,4.6 pol.sin.tnh.exp = polynomial, sin, tan hyperbolic 
exponential modeling functions, nc = no. clusters, rd = data prepared using mean, with SAIL R&D *s experience, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 



O' 

l^i 

Iww I 

O! 

t\y^ 

01 5/"' r-4 

El 


,c\! ^ ^ ^ ^ ^ ^ v> csi iw 

' O) o ^ S ^ S fe s JS JS ^ CM Xf a> 

fV\ 'rt ., • V • tn ^ R P o O CO t- CD £n S 


a:- 


*X^ .'D 


fSi 


O <M CO 




U W O 


.CMCMCMh-iniDlOlOoOcOM^^CNi 




'Ti •?*' tri *C' 

“M n o r-« 
X T' o r\4 

':\f f''4 T'rf *- 


f\f x- ■^ •»- 
X to CM CM 


S'^STJCOCOcOOCDt- ^ 
ai*ncMcM'COcDiL!i: 7 I^incD!?^ 


2 ^ 5® <5 o o ® w p o 


: 5 ; 2 ^SroeocM 2 acocou 3 io 


^cococM^c^Jojoji^h^cbcdcdcD 


ok^ 

hm 

Iw 

© 


O' rt} 5 ,0 ^ fo ^ 

u ’ 'Ti ” ‘ " ru «*«» O ^ 

fo '^r C 2 


C,0 ^ «|mMN 

fM o « w ^ p: 




CO T- 
CO 1!— 


CO g 
^ ; ^ 00 


CMfcjMin^CDN.r^LO 

• . r«" CD rs- m •#./■»-» r 


. . r>- O 

^ ° « 


r^ CO "KT cq 

M- T-^ T-^ o T-: 


oo 22 po 3 :r:c^c?>co 

iD Z Z ^ ^ ^ ^ 

^ O CM 

fSI 


S ?3 


GO CO 5^ 512 *5 


« 2 ^ to uo [:: «> 


. OO N. 
1 CO to 


00 r^ lo c3> 

M* xr -Tf CO 


2 CO 

J- X xr r*-. O 130 

o m -V ^ 


S^^M'cOfOc^xrcMoof>>.xr 


' «0 
O' 


S tO CO 
CM PS 
^>1 M* xf O O 


CO xf ci O 


o T- CO 
i 00 00 o> CO 
Cvi o CD 


^h-COT-T-M- 
CMM;M;xrc 0 M* 
T- d CD d CD CD 


Of 

i.ixr m XT 
SjO « ^ 

c 

£ 


ao 


' wo 


os <r-< 


co o> 


‘ o> 


!:rsS!S«:T^ 2 S 3 : 


„=2^g2»^^5„ESssaE”sSS2S3S5; 


sssssssssssssssssssssssssssssggg 


»i;ft 


o o 


ssn 


(O 


o> 

CM 


xrOr-CMCOCMCOxrrr’r-PcMCg 
T-^Ox-^T*^CMCMC\icMCM^ 


00 00 CO h- lO 
X- T- CM O ^ 
CM CM CM CM CM CM CM CM 


CM 5 -' C» WO '00 CO CM K 
CM XT CO 'CO CO CM' CO 


♦s. fs. 


S! 2 £:f 5 ?'^'^^^‘^i^cot^c-ooxrxraDcftc 35 c 33 xj*r\j«r- 

_ ^‘PP'^wDCDiouoooC'-h-fN.xrtoioxrM'ioiOLOioco 

CO CM CM CM CM wo XT XT CO to M- xr lO xr xr xr CO CO CO CO CM CM CM CM CM CM CM CM CM CM 


CO CO CO WO' to 00 

■CO xr 00 CO' CO o 

d d 'd d d CM' 


CM 


CO to CM 
03 CO OO 

d d x-^ 


▼- CM Xf uo CO 

CM CD N- 1^ CM 

X-- x-^ d T-^ T-^ T-^ 


cotnxi*c 3 >co<o^^ 
x-ooxrcococo^^ 
d d d d d ^ 


OSr^CMx— t-OOM-cO 
OCDt-T-T-Ox-T- 

d d d d d d d d 


frtiASS£! 2 £-io> 5 SlStt::ir 2 ?!£?S^cn^^^^^^^<^<ocDCDcocococD 

tOtnxrvM^xrx-^.<qopCD< 3 >oc)oc>cDcn>PcocooooooocococococDcocococDco 

O O O O d O ^ O CD d d d r-^ td d d H r-» r-i r-J r-i r^' rn r-c ^ 


o o d 


„ COCOCOCOCOCDx-^CDWT-^CDxr-r- 
OOOOOOx-t“^OT-T-^Ox-T- 

oodddddd^ddd'^ddd 


V. 

XT 

s 

0 > 

'CO 

CO 

CD 

CO 

CD 

in 


CD 

CO 

CO 

CO 

00 

tn 

CD 

© 


q 

q 

q q 


0 

q 

q 

XT 

0 

0 

CD 

CO 

0 

q 

CD 

C 

E 

. 

T- 


'T- 

Of— 

CO' 

CO 

CM 

oi 

CO 

CO 

CO 

CM 

CO 

CO 

CO 

CM 


■5 

■s 

cl 

^ :5 ‘8 

CM 

0 

8 ^ 

tn 

0 

CM 

0 

CO 

0 

■5 

m 

0 

CM 

0 

CO 

0 

■5 

in 

0 


C 

c 

c 

c 

c c; 

C 

G 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

TO 

O 

x: 

c 

x: 

S 

cx 

X 

© 

■Q. 

X 

© 

d d 

X X 
© © 

0 

CL 

0 

q 

0 

q 

— 

q 

c 

q 

,£ 

*« 

.2 

*« 

’« 

JG 

C 

s 

x: 

c 

x: 

c 

x; 


r-* 




cxi 

CM 

CM 

csi 

CM 

CM 

CM 

cm' 

cvi 

Cvi 

CM 

oi 


> 

> 

> 

> 

> > 

> 

> 

> 

> 

> 

> 

> 

> 

> 

> 

> 

> 

© 

:e 

2 


2 

%£ 

d ^ 

2 



d 

E 

2 

2 

2 

vH 

2 

2 

2 


£5 

b 

"5 

& 

H b 8 b 

b b b 

iS 

0 

tml 

0 

b 

iG 

q 

q 

0 

b 


*E 

■E 

E 

'*0 

w 

"S "E 

E 

E 

E 

E 

T 3 

Im. 

E 

*D 

k. 

E 

X 3 

Im. 

TD 

k. 

E 

E 


o o o o o o o d d d d d d d d 


0>CDCDO)COCOCOCOCOCOCDCDCOCD 

oqqqpqooqqqooo 

d d d d d d d d d d d d d d 


xj-xTh-CDh-h-CDCOh-r^COCD 


c c: c: c c: c 

g- g- g-^ "S '5 

( 1 ) 0 0 q q q 

CM CM CM td ^ ^12 

>>>>>> 

d d d ^ ^ ^ 

ix ix 

O C 3 O ^ ^ 

T 3 T 3 TD "2 "P "P 


tn CM 
O O 

c c 
o’ .£ 
P- P 

CO CO 
> > 
.o d 


CO xt- to CM CO 

o o o o o 

c c c c c 

c c x: jz 

'(/> '(/i '</) S S 

CO CO CO CO CO 

> > > > > 

^ <2 S 3 ^ 

iJ tj kl vl iZ 

q q q q q 

X 3 *d "d ID T 3 

Iw b_ k. U. u. 


§ 

o 

■a 

0 

u. 

0 

3 

O" 

M 

C 

m 

0 

E 



0 

2 

k. 

CM fv. T- 

s; 

0 

q q CD 

0 

XI 

CO 

T— d T— 

0 

£ 


Q. 

uT 


>» 

0 


X 

c 

0 ^ 

D- m CO 

B 

c b 

00 ir- CO 

2 

© © 
1 £ 

XT XT CO 

tn 


.1 

II I 


■£ 

0 

k. .0= 

0 c 

fK. XJ- fV. 

c 

3k 

© ^ 

Xf CO CM 

E 

c » 

d d d 

Q. 

£ b 


1} 

k. 


c:l 

0 ^ 


X 

0 

8 c 

c 


T3 

© 


C 

o 

o 

6 

JQ 

cs 


0 

C '3 - 

£ 0-2 

3 g S 

owe 

p 

to ^ .E 

eS 

CO d e 

CM < ^ 

- CO 1 J_ 
^ x: p 
c --s t 

.2 5 © 

I § ?§ 

g I E 

II CD O 

to C fc 
> * C 0 © 

M- 0 :r 

CO © C 
Ct 3 -2 
^ Q - 0 

>- £ -s 

■r Q. © 
ctj -a 
? cc E 
c -a CO 

B » c 
lo "O © 
E w 


c -2 -n 
0^0 


© 

£ 

6 6 
3 k c 

a 11 
a g 


CO "O 

c P 


-.2 © 

O C ^ 

OT 3 « 
C 

© 05 © 

OJC ® 
0 g 

© "o O 

© c o 

*1 ^ 

© 

'c 2 


© -43 


© ^ 

c © 

2 © 

Q. c 

X E 

© *- 


125 


O' 

fc i 


in 


■*n 

1/1 




v*y 




■M- 

u2 

I'M 


■o 




tn 

!»*• 

M- 


m 

f®"5 

40 

CO 


IX 1 


M3 

f-O 


1j 

1 












u j 












2:-r 

^2 

f,n 

in 

jj 

“r 

’y»3 

m 

1.0 


fM 

.'fM 






•r 


o 


'lO 

Its 



rv 

f m. 

! C ^ 












* C < 












l"o: 












'S' 

«i 


r»,. 

',ri 

,« 

■r 


yr 


If) 

fO 

0 

• '« 

r: 

‘■n 


CM 

m 

4M 

M 

05 

f Of rM 

1 L* t 

t 

.'M 

tD 

■JD 


I’M 


M" 



o 

*r“ 


® CM so sr> <D 


SOCO<©*-;*<Ef'icfM"®OOCC>«ri** '^*-m* 


00 .SO 
Ml lA 
CO ao 


00 «> rv. ^ ^ QJ rv- 

<£> O 00 00 -r- 03 O 




S ^ ' ■ CD CD 
- 2 ? 05 C» 


! ® c 6 


ra ^ 00 CO m CO M“ 

CD 00 cs| <jq 1/5 CO CM . ^ . 

O Csi Csi O cd T-^ CM r-' -CO CO 


^ ^ 

CO CO 00 CO 


,00®CNCOCOt- 
.I^COxTCDcMt- 
■ M" M- CO r-* U5 id 


•o 

21 o.. 

CLl W|^J 

«se 

ir 


'O’ ro fO 


cO'if'trMrco 


2SSggS§g}-g;j?^?SESSS§£Sr||5jTEoS 


MT'M’^'M'M'^CDtdcdldaOCD*^ 


lb.:*? --T 

® 'O o 


21 - 'P 

o 


.ip 


10 

fM 

N., 

fcf) 

fM 

CM 

'05 

‘SO 

M* 


CO 

Oi 

r%. 


M* 

in 

CO 

Gi 

T*" 

<Ji 


CM 

CM 

'T- 

I'M 


43 

fifc 

fvfc 

fO 

M* 

MT 

M* 

CO 

M" 

•tr 

CO 

CM 

M* 

M* 

Q> 

p 

CD 



p 

CD 

p 



O 

O 

o 


O 

O 

O 

o 

O 

o 

d 

'd 

d 

d 

CM 

T— 

d 

d 

d 

T~ 

CD 

d 

CM 

f-v. 

TJ 

yf 

(O 

m 

M* 

ao 



fs- 

03 

fs. 

CM 

ro 

CO 


m 


CM 

CO 

CO 

"M* 

CO 

p"- 


ai 


ro 



OQ 

m 

liM 

CM 

CO 


o 

CO 

o 

CM 

CM 

CO 

r^ 

p 


co 


lO 



m 


O 

40 

m 

'CO 


to 

d 

CD 

M- 

CO 

d 

rd 

T« 

CJ> 

rd 

cd 

00 

W 

CD* 

cd 


/^■^M’uor^co’Krvo 
yr)CDCpCqM;U5<OCO 
o d d CD d d d CD 

— CM®f^CMCOCDT- 

tOoqcor^05coocM 

<;P CD CD id CD CD 


©is S ’G ”-2 *‘2 ^ ^ ^ ® CO 00 CO 00 CO CO CO 

gji,4 kA in ctl ® ® liJ) 03 O) ® ® ® 0 > ® ® 0 > ® ® CD 

0 1 o o o o s3 o o o o o o o O' o d d d d d d d d 


© 
e 

E 

“IT 

o 
t 

9 
, X 

© 

.E 
w 
m 
m 

OJf P 


■m ** m i» fo ^ 

r'M iM rM rt «'0 fp 

CM <M fM CM <M M 


f43r'**N>oo^r^f^cor^S®®®^^cooooppo^ 


r-* 00 

O P o 
d o 


CM M- 

o o O o 
o d 


^ ^ ^ ® 2? c^ P rt 'CO CO CO h. M* M- oo CO 

llO'fcD^M-'M’^COOOpcOpOMpppppT-pcO^ 

^ ‘‘‘ ' .CM ^ CM CM CM 'CM 00 00 cd CM cd cd CM cd cd cd CM cd cd cd ^ 


-^fOCMCMOOp^lDlOp^ 

CDCOCD^^^pfvj<^P 


OO ^ 

odd 


CJ <N . 

d CD O o 


CO m CO 

^ o o 


«,-fl3CDCDU3®r^COCD 

CMoocococMM*T-'r~T-co®p^®CMCDi>-cor'^mioiD;^goPPPPPP 

OprocopooppcococMPpppppppppP2^*0jooooo 

'^ddddddddocDc;^ 


OOOOOOOOOO'O'OOOO 


fOCOCOpcOpOpOOpCO'OOOOOOOO 

copSSsdppcoScococococo 

oooacydddcidddddciaci 


00 00 00 00 00 CO 00 


ododoooo 


d CD 


.^COCOCMM-M-M-CO 

oocDcocvJf^gSpqqo qq 
PppppppppppH^-^'OOOOOO 


SgSSSSSgSSSgSSggSSgggSoooo 

oooddddddddddddddddddd 


O C) 


o o o o o o 


^ CM T- T“ CO 

icDfs*r^rv.f^h-cocooococococorococMcocoroc2oacM®®oog'rSdSl‘I^!I:S 

oocococoppM;M;M;M^M;M;M:M;:xrM;M;M;M;M:M;M;pp.p^c3CDC3ooooo 

CMCMCMCMCMCMCMSMCMCMCMCMCMCMCMCMCMCMCMCMCMCMOOCDC:^ 


n 

js jd 
c c 


CO CO 
> > 
o ^ 

t 5 b 

T 3 X 3 


CM CO 


*2 <N <2 ■'J P CM p 


C C 


a a o. o- 

K K X H 
© © © CD 
cd od od cd' 
> > > > 
o d o 

b b b b 

*0 "O' "O *o 


c-i o .5 


O 0 o 

a q. p a w 

M* M* M* M- M* 

> > > > > 

^ d ^ ^ d 

b (3 b <3 b 

TD xj d "d d 


in CM CO M" 
o o o o 
c c c c 
c d d if 
■S 5 c c c 


in CM CO 

o o o 
c C c 


M" lO CM 
O O ' ' 

c c 




^ ^ ^ ^ 
> > > > 


a. CL 

X X 
© © 

d d 

> > 


OO OOOOOOO 

V 4 . W-. I«_. Ui,.. VI.- VI-. V*-. V*-. 

ItpiB VMMr Ihm 'imm VmW UiMi 

q q q q o o q o q 

ddddddddd 


CD 

. c 

Q. CLp^ 

X X 2 

© © C-L 

d d d 

> > > 

d d o _ 

o b u o q 

d d d d *2 


c q 

o o 

CL Cl 
lO m 
> > 
ci 


m dJ 
o 2 
c 2 

II 

iD d 


cO ^ in CM CO 'M- lo 
*3 O o o o o o 

g £ c c c c c 

c cf c x: i= j© iz 

•i CO *q S £ S S 

d lO d tn in iri 1X3 
^ > > > > > > 

f-j d ci ti d ti 


d *2 w 


5 d a vH vH vH v2 vH 
5 bobbbqb 
d d d d d d d 

X u. V-> >» V-. L- V. 


•D 

© 

3 

C 

'S 

c 

o 

u 

6 

© 

3 

9 

I- 
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exponential modeling functions, nc = no. clusters, rd = data prepared using mean with SAIL R&D 's experience, mn error = rnean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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cr = cluster wise regression, fc = fuzzy c-means clustering, km= k-mean clustering, v1 ,v2,v3,v4,v5 = variation 1 , 2,3, 4, 5 pol,sin,tnh,exp = polynomial, sin, tan hyperbolic 
exponential modeling functions, nc = no. clusters, rd = data prepared using mean with SAIL R&D 's experience, mn error = mean error, ms error = rnean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 


fif^ -’v « 


'■^rrSSSSSS^-SggSSqfegsSSSSSS 


fc:£:5::^r'r«:^S;2 55S?SJ:5g§5525c^J5ggJog,g(^, „„ 


iA ■,'?^ in «n 'T^ fl 


gUir. 


S? 'a '2 w cn ® 

o O i/5 ^ r««. ^ ?0 


CO ^ 00 gj 2f CO. 


CO 

OJ ▼- '(- 


. CO 00 o-j h- 
. " ® 


1^ ^ flfs o r-'*^ 

r- 5 ^ » A ^ 

«S wO O *• 40 


?''•* i/5 fM in *- ¥-* CM 
M- ro CO iO 0> 00' '«0 


CMCncO'r-cOf^CNiUOCOOOOlOcsjCOvO'^ScNJ? 
T-CMlOT-ci'r^CD'r-^'r-^^cd 'r^Csi‘^C\iY-^Csi 


ro C> CO ir? IB <n fM .fv. f- CO CO i>- 2 CO CO CM 
j« ® M- ^ ro fw M* CO fM ® O U> ^ CO M- O -r- s 
I^xsuom* M’lOroM'M’rvcocoV^Ktdcb'io 


sjCMesiTfincOCDCMK^M-M^r'incOCO 

5ina>cocj5coc^CM*^cor^*^'TioS 

'*’ XT cd CO M- P-** ^ CO M* S lb CD 


W ^rO ooco '■© M* -!«- CO!N.COI%«^ 
i-.C30 0>'"rOf'0 TT dorotrt'trcJJCM^ 
f|rM 0^^000000000® 

«l 


CMC^UOCOKOr-COOOT— C*-C!>M‘N*CMtoSS 
T--;0^~.<qcOM;M;a::>cocoTrocMCMxr°^ 


Q 00 ^ Q Q Q ^ Q Q Q O O O 


CD 

^ T- T- 05 

T~ CO r^ 


m i* ^ ^ 5? O 2 rt <o o 05 M- CO to r- CO h- CO CM 

W r-^ ^ O O Ml CM iS O M* CO O o ^ ® XT CO O C- CO 00 c- CM o c- 00 00 XT 
‘CM ... m ^ in o ^ . • • ■ 


® 10 .|0 tjo iO i/> MO o M- M* 'CD ® 00 cb lb CD cb CD lb CD CD f^ CD CO 


T« CM S CM CO 
CM r-- CD 


I n m flo no fo 
f o o ».D m m 


eomeOctteooocoaDcoaDcooDcooooocooocococoooQocococooocooooo 

tfl®4Sf5CR>0>O5O5O5ClOO5O)a5O5a5O>CJ5C3>CDC]5O5CI5O5O5C35CJ5O5C5a)O5 


oooooooooooooooooooooooooooooooo 


CKi^r%,«Mi0*0«ooorO'CO'f’yO'CO'*«-cO'r-cocoooininincM.r^r^h>-M-rntfiaor^f^rN.rs^ 
«>.fO'rtiMCM. CMfM'^’^-^rMCMOT-CMCM'r-oocococpf^r^r^r^.r^r^r^N:'". 

C^CM.<M.«M«M<MCM.'<MCMCMCMCMCMCMCMCMCMCMCMt“^'r^T^-r^'r^T-^'«-^'r-^'^'‘^'*~'^'*“'^’^ 

I 

■"ci 

o 

J-COCO«O«<r^O>05COO5'T'^CM'r-M‘r^COOO'<r-T~M-^g5g>g5^CDCOin^COCOM'M*M*M- 
®r^^•ln« 1 >M*M’COMOuOlb^*<Dur:)l^M;M;M^M;c 350 c^pcx:)OOCDcx 3 0 c^c::>T-^T-■f-•'r- 
^^COCOCMCNICMCMCMCMCMCMCMCMCMCMCMOaCMCMCMcbcbcbcMcbcbcbcMcbcbcbcbcbcbcO 

w 

E 


s??gss 


oooooododddd 


^OOCOCMCMincOM-COCM;^^CMT- 

'^OOCOOCOCOCOCOCOCOCOCOCO 

^ddddddddddddd 


^ CO CD r- CD 

^ CO CO CO CO 

^ d d d d 


SgggSgSggSS§SSS5gS§SS§$§SSS§g§S§Sg§g 

firt d d d d d d d d d o o o o o o o o o o o o o o o o o o o o o o o o o 

§g§§SS§8SSSSSSS§S§S§S§So°°°9°oS§S§§ 

'®*oddodddddddddddddddddooooooooooooo 

.<§■ 


|-'.ur5 M* CO P:- ^ ^ ^ 

05 d> 'CO CO CO CO CO CO 
— CsiCMCMCMCMCMCMCM 


h«.r^incDr^r^cDP^P::r^^£2co^222 

<ocococococococococom;m*m;m-m;m;m; 

CMCMCMcicMCMCMCMCvicMCMCMCMCMCMCMCN 


COCOCMM'COTrCOCOCO 

CMCMCMCMCMCMCMCMCM 


^‘Sg'S^'S 

C C C C G C 


R S. CL Q.’ 


C C £ 

m 0) q 


pj Cvi <5 <5 rt ” 

> > > > ^ ^ 

E e e § i § 

b S <3 <3 b Jj 

p -p E 12 'E 'S 


CM CO m CM < 

o o o o o 

c c c c c 

■ ”5 o *0 o .£ . 

CL Cl CL CL <n 

tT Tf M- 

> > > > > 

E E E E £ 

jn: -5/ ^ M*; 

b b O O G 

i T3 ■6 p d d 


c c c c c 

jz Q. d d d 

"F X X X X 

d 0 D CD 0 

^ ^ 

> > > > > 

g E E E E 

Oil jiiC ^ MiC> -X 

ti (3 b b (3 

p d d "g "2 
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cr ■ cluster wise regression, kh=SOM clustering, v1 ,v2,v3,v4,v5 = vatiallon 1 , 2,3,4,5 pol,sin,lnh,exp = polynomlal.sln.tan hyperbolic 

exponential modeling functions, nc = no. clusters, rd = data prepared using mean with SAIL R&D 's experience, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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Table C: conitinued 

oclstr=ortho-clustering, kh=SOM clustering, km= k-means clustering. e1=epsilon(0.001), e2=Gpsilon(0,005),e3=epsilon(0,01), e4=epsilon(0.05), e5=epsilon(0.1) 
nc = no clusters, ica = data prepared using ICA, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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Table C; continued 

oclstr=ortho-clustering, kh=S0M clustering, at=ART2, fa=fuzzy ART, e1=epsilon(0.001). e2=epsilon(0.005),e3=epsilon(0.01), e4=epsilon(0.05), e5=epsilon(0.1) 
autregrma = ARMA, vi,v2,v3,v4,v5 = variation 1 , 2.3, 4, 5 pol.sin.tnh.exp = polynomial,sin,tan hyperbolic, exponential modeling functions 
nc = no clusters lea = data prepared using ICA, mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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Table D: continued 

oclstr=ortho-clu8lorlng, kh=SOM clustering, km= k-means clustering, e1=epsilon(0.001), o2=op!jiloti(0,005),o3=ep8llon(0.01), e4=ep8ilon(0.05), e5=ep8llon(0.1) 
nc » no clusters, lea - data prepared using ICA, mn error « moan error, nis oiioi - tuoan squaiod oiior 
rms error “ root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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or => cluster wise regression, kh=SOM clustering, at=A.R.T.2 clustering, v1,v2,v3,v4,v5 = variation 1, 2, 3,4,5 pol,8ln.tnh,exp = polynomlal,8ln tan hyperbolic 
exponential modeling functions, nc = no. clusters,box=model of Box Gas Furnace mn error = mean error, ms error = mean squared error 
rms error = root mean squared error, error std = standard deviation of error, max error = maximum error, min error = minimum error 
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Fig 3.1 evaluation of Sugeno method on each problem 
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